Progesterone receptor-regulated gene expression and methods related thereto

ABSTRACT

Disclosed are expression profiles of genes that are regulated by progesterone receptors, and particularly by progesterone receptor isoforms PR-A and PR-B. Methods for using such genes to identifying progesterone receptor agonist and antagonist ligands are described. Also described are methods for identifying isoform-specific progesterone receptor ligands, for identifying tissue-specific progesterone receptor ligands, and for determining the profile of genes regulated by progesterone receptors in a breast tumor sample. In addition, pluralities of polynucleotides from genes that are regulated by progesterone receptors are disclosed, as are pluralities of antibodies that selectively bind to proteins encoded by such genes.

CROSS-REFERENCE TO RELATED APPLICATIONS

[0001] This application claims the benefit of priority under 35 U.S.C. §119(e) from U.S. Provisional Application Serial No. 60/214,870, filedJun. 28, 2000, entitled “Surrogate Gene Markers for Two DifferentProgesterone Receptor Isoforms in Breast Cancer, and Their Use to Screenfor Isoform-Selective Progestational Ligands”. The entire disclosure ofU.S. Provisional Application Serial No. 60/214,780 is incorporatedherein by reference.

FIELD OF THE INVENTION

[0002] This invention generally relates to expression profiles of genesthat are regulated by progesterone receptors, and particularly byprogesterone receptor isoforms PR-A and PR-B, and to the use of suchgenes in methods for identifying progesterone receptor agonist andantagonist ligands, including progesterone receptor isoform-specificligands and tissue-specific ligands. This invention also relates tomethods for determining the profile of genes regulated by progesteronereceptors in a tissue sample. In addition, pluralities ofpolynucleotides transcribed from genes that are regulated byprogesterone receptors are disclosed, as are pluralities of antibodiesthat selectively bind to proteins encoded by such genes.

BACKGROUND OF THE INVENTION

[0003] Progesterone is a natural reproductive hormone that targets thebreast, uterus, ovaries, brain, bone, blood vessels, immune system, etc.Progestational agents are widely used for oral contraception, menopausalhormone replacement therapy, and cancer treatments. Antiprogestins,which are synthetic ligands that antagonize the actions of progesterone,are in clinical trials for contraception, for induction of labor, and totreat endometriosis, breast cancers and meningiomas. The actions ofprogesterone are varied and tissue-specific. Even in the normal breastit can have diverse effects: depending on the physiological state of thewoman, progesterone can be proliferative, antiproliferative, ordifferentiative. Additionally, progesterone promotes the development ofbreast cancers and accelerates the growth of established breast cancers.For example, when used for hormone replacement therapy at menopause,progestins, which are synthetic progestational agents, increase the riskof breast cancer. Paradoxically, they are protective in the uterus andprevent endometrial cancers.

[0004] Progesterone, synthetic progestins, and antiprogestins allinitially work through the same molecular pathway. These are lowmolecular weight, lipid soluble “ligands”. They enter target cellspassively, and pass into the nucleus where they bind to progesteronereceptors (PRs). Ligand binding activates the PR proteins, which thendimerize, bind to DNA at the promoters of progesterone target genes, andeither up- or down-regulate transcription of these genes. There are twonatural isoforms of PR, the A- and B-receptors, also referred to hereinas PR-A and PR-B, respectively. The isoforms are derived from twodistinct promoters in the single PR gene and are translated fromseparate translation initiation start sites. PR-B receptors are 933amino acids in length, which is 164 amino acids longer at the N-terminusthan PR-A, and contain a unique transcriptional activation function,AF-3 (Sartorius et al., Mol. Endocrinol. 8, 1347-1360 (1994)).Downstream of the additional 164 amino acids of PR-B, the two PRs havethe identical primary amino acid content. However, despite this closeamino acid composition, the two receptors have dramatically differentabilities to activate transcription of progestin-responsive promoters inexperimental model systems (Sartorius et al., Mol. Endocrinol. 8,1347-1360 (1994); Meyer et al., J. Biol. Chem. 267, 10882-10887 (1992);Vegeto et al., Mol. Endocrinol. 7, 1244-1255 (1993); Tung et al., Mol.Endocrinol. 7, 1256-1265 (1993); Sartorius et al., J. Biol. Chem. 268,9262-9266 (1993)). Progestin agonist-liganded PR-B are strongertransactivators than PR-A, although there are cell-type andpromoter-dependent exceptions. The antiprogestin RU486 has mixedagonist/antagonist activity on PR-B but not PR-A. Instead, agonist orantagonist-liganded PR-A can dominantly inhibit PR-B and other membersof the steroid receptor family, including estrogen receptors (ERs).Thus, PR-A are more likely to be transcriptional repressors than PR-B.(Hovland et al., J Biol Chem 273, 5455-60(1998); Vegeto et al., Mol.Endocrinol. 7, 1244-1255 (1993); McDonnell et al., J. Biol. Chem. 269,11945-11949 (1994)).

[0005] Indirect data suggest that the two PR isoforms havephysiologically different functions. They are unequally expressed indifferent tissues and physiological states. For instance, increasingratios of PR-A to PR-B in the chick oviduct in late winter, or in aged,nonlaying hens, resulted in measurable decreases in PR functionalactivity (Boyd-Leinen et al., Endocrinology 111, 30-36 (1982); Spelsberget al., Endocrinology 107, 1234-44 (1980)). There are stage-specific andregion-specific variations in the PR-A:PR-B ratio in the developing ratbrain (Kato et al., J Steroid Biochem Mol Biol 47, 173-82 (1993)) andstudies in primates show that PR-B predominates in the estrogen treatedhypothalamus, while expression of the PR-A isoform predominates in thepituitary (Baez et al., J Biol Chem 262, 6582-8 (1987); Bethea et al.,Endocrinology 139, 677-87 (1998)). In the human endometrium, absolutelevels and the ratio of PR-A to PR-B vary extensively during themenstrual cycle (Mote et al., Hum Reprod 15 Suppl 3, 48-56 (2000); Moteet al., J Clin Endocrinol Metab 84, 2963-71 (1999); Mangal et al., JSteroid Biochem Mol Biol 63, 195-202 (1997); Feil et al., Endocrinology123, 2506-2513 (1988)). In addition, uncontrolled, is or over-expressedPR-B levels are associated with a highly malignant phenotype inendometrial, cervical and ovarian cancers (Farr et al., Mamm. Genome 4,577-584 (1993); Fujimoto et al., J Steroid Biochem Mol Biol 62, 449-54(1997)).

[0006] In the normal breast, progesterone is both proliferative anddifferentiative [reviewed in\(Clarke et al., Endocr. Rev. 11, 266-301(1990))]. Breast epithelium mitoses increase during the menstrual cycleand peak in the late luteal phase, coincident with high circulatinglevels of progesterone. Progesterone induces lobular-alveolar outgrowthduring each menstrual cycle and during pregnancy induces furtherlobular-alveolar development in preparation for the terminaldifferentiative event of lactation. PR null mice exhibit incompletemammary gland ductal branching and failure of lobulo-alveolardevelopment, as well as failure to ovulate and to exhibit sexualbehavior (Lydon et al., Genes Develop. 9, 2266-2278 (1995)).

[0007] Little is known about cyclic changes in PR-A and PR-B in thenormal human breast.

[0008] However, in the mouse mammary gland, evidence supports a criticaland unique role for each of the two PR isoforms. It has been reportedthat a 3:1 overexpression of PR-A over PR-B results in extensive mammarygland epithelial cell hyperplasia, excessive ductal branching, and adisorganized basement membrane; all features associated with neoplasia(Shyamala et al., Proc Natl Acad Sci U S A 95, 696-701 (1998)). Incontrast, when PR-B is overexpressed, ductal growth prematurely arrestsand inappropriate lobulo-alveolar formation is observed (Shyamala etal., Proc Natl Acad Sci U S A 97, 3044-9 (2000)). However, when the PR-Aisoform was selectively knocked out, leaving only PR-B, the mammarygland appeared to develop normally in response to estradiol andprogesterone. In contrast, decidualization of the endometrium and thenormal antiproliferative effect of progesterone in the uterus wereabsent (Mulac-Jericevic et al., Science 289, 1751-4 (2000)). Such dataindicate that PR-A and PR-B have different tissue-specific effects.

[0009] In human breast cancers the presence of PR in estrogen receptor(ER) positive tumors indicates that responsiveness to endocrinetherapies is likely, while absence of PR is associated with hormoneresistance thus, PR are routinely measured in breast cancers as a guideto treatment (Horwitz et al., Recent Prog. Horm. Res. 41, 249-316(1985); Horwitz et al., J Biol Chem 253, 8185-91 (1978); McGuire, Semin.Oncol. 5, 428-433 (1978)). PR are also direct targets of second-lineprogestin therapies in patients whose tumors have developed antiestrogenresistance (Kimmick et al., Cancer Treat Res 94, 231-54 (1998); Howellet al., Recent Results Cancer Res 152, 227-44 (1998)). Nothing is known,however, about the role of PR-A vs. PR-B in breast cancers. The PR-A toPR-B ratio was measured in 202 PR-positive human breast tumors (Grahamet al., Cancer Res. 55, 5063-5068 (1995)). The majority had PR-A to PR-Bratios greater than one, and 33% had 3.7 times or more PR-A than PR-B.The functional significance of this is unknown. In breast cancer celllines, overexpression of PR-A results in marked changes in morphologyand loss of adherent properties (McGowan et al., Mol Endocrinol 13,1657-71(1999)). Thus, overexpression of PR-A as seen in many breasttumors, may lead to suppression of PR-B, and may be associated with poorprognosis. However, there are no clinical data to support thisconjecture.

[0010] Prior to the present invention, few, if any, endogenous genesdifferentially regulated by PR-A vs. PR-B were known in breast cancersor any other tissues. An excess of PR-A enhances the expression of SOX4mRNA levels in breast cancer cells. Whether PR-B also regulates thisgene is unknown. SOX4 induces DNA bending. PR-A enhance expression ofthe mouse multiple drug resistance (mdr) 1b gene, important fordevelopment of drug resistance in tumors. Whether this gene is regulatedendogenously only by PR-A is unknown. To the present inventors'knowledge, no data on PR-B specific gene regulation in breast cancers(or any tissues) has been published prior to the present invention.Although certain of the genes listed in Table 8 below were previouslyknown to be progesterone regulated, the PR isoform specificity of thisregulation was not known.

[0011] Knowledge of the unique sets of genes that are selectivelyregulated by each PR isoform would serve as a surrogate marker for thepresence and function of PR-A vs. PR-B in various tissue types and invarious disease states. Furthermore, knowledge of such genes and theirpromoters, would serve as a tool for screening PR ligands, andparticularly, PR-A vs. PR-B selective ligands. However, defining whichsets of genes are uniquely regulated by one or the other PR isoform inbreast cancers was impossible in progesterone target tissues becauseboth PR-A and PR-B receptors are simultaneously present in thosetissues, and are simultaneously activated by progesterone treatment.

SUMMARY OF THE INVENTION

[0012] One embodiment of the present invention relates to a method toidentify agonist ligands of progesterone receptors. The method includesthe steps of: (a) contacting a progesterone receptor with a putativeagonist ligand, wherein the progesterone receptor is selected from thegroup consisting of progesterone receptor A (PR-A) and progesteronereceptor B (PR-B), under conditions wherein, in the absence of theputative agonist ligand, the progesterone receptor is not activated; (b)detecting expression of at least one gene that is regulated by theprogesterone receptor when the progesterone receptor is activated; and,(c) comparing the expression of the at least one gene in the presenceand in the absence of the putative agonist ligand, wherein detection ofregulation of the expression of the at least one gene in the mannerassociated with activation of the progesterone receptor as set forth in(b) indicates that the putative agonist ligand is a progesteronereceptor agonist.

[0013] In one aspect, detection of upregulation of expression of atleast one gene chosen from a gene in Table 1, or detection ofdownregulation of at least one gene chosen from a gene in Table 2, inthe presence of the putative agonist ligand, indicates that the putativeagonist ligand is a selective agonist of PR-A. In another aspect,detection of upregulation of expression of at least one gene chosen froma gene in Table 3, or detection of downregulation of at least one genechosen from a gene in Table 4, in the presence of the putative agonistligand, indicates that the putative agonist ligand is a selectiveagonist of PR-B.

[0014] Another embodiment of the present invention relates to a methodto identify antagonists of progesterone receptors. This method includesthe steps of: (a) contacting a progesterone receptor with a putativeantagonist ligand, wherein the progesterone receptor is selected fromthe group consisting of progesterone receptor A (PR-A) and progesteronereceptor B (PR-B), under conditions wherein, in the absence of theputative antagonist ligand, the progesterone receptor is activated; (b)detecting expression of at least one gene that is regulated by theprogesterone receptor when the progesterone receptor is activated; and,(c) comparing the expression of the at least one gene in the presenceand in the absence of the putative antagonist ligand, wherein detectionof inhibition or reversal of the regulation of expression of the atleast one gene as compared to the regulation of expression of the atleast one gene in the manner associated with activation of theprogesterone receptor as set forth in (b), indicates that the putativeantagonist ligand is a progesterone receptor antagonist. Theprogesterone receptor can be activated by contacting the receptor with acompound that activates the receptor, the step of contacting beingperformed prior to, simultaneously with, or after the step of contactingof (a).

[0015] In one aspect of this embodiment, detection of inhibition ofexpression or downregulated expression of at least one gene chosen froma gene in Table 1 in the presence of the putative antagonist ligand ascompared to the expression of the at least one gene in the presence ofthe compound that activates the progesterone receptor, or detection ofinhibition of expression or upregulation of expression of at least onegene chosen from a gene in Table 2 in the presence of the putativeantagonist ligand as compared to the expression of the at least one genein the presence of the compound that activates the progesteronereceptor, indicates that the putative antagonist ligand is a selectiveantagonist of PR-A. In another aspect, detection of inhibition ofexpression or downregulation of expression of at least one gene chosenfrom a gene in Table 3 in the presence of the putative antagonist ligandas compared to the expression of the at least one gene in the presenceof the compound that activates the progesterone receptor, or detectionof inhibition of expression or upregulation of expression of at leastone gene chosen from a gene in Table 4, in the presence of the putativeantagonist ligand as compared to the expression of the at least one genein the presence of the compound that activates the progesteronereceptor, indicates that the putative antagonist ligand is a selectiveantagonist of PR-B.

[0016] In each of the above-described methods, the at least one gene isselected from the group consisting of: (i) at least one gene that isselectively upregulated by PR-A chosen from a gene in Table 1; (ii) atleast one gene that is selectively downregulated by PR-A chosen from agene in Table 2; (iii) at least one gene that is selectively upregulatedby PR-B chosen from a gene in Table 3; (iv) at least one gene that isselectively downregulated by PR-B chosen from a gene in Table 4; (v) atleast one gene that is upregulated or downregulated by both PR-A andPR-B chosen from a gene in Table 5; (vi) at least one gene that isreciprocally regulated by PR-A and PR-B chosen from a gene in Table 6;and, (vii) at least one gene that is regulated by one of the PR-A or thePR-B, wherein regulation of the gene is altered when the other of thePR-A or PR-B is expressed by the same cell, chosen from a gene in Table7. In one embodiment, the method further includes a step of detectingexpression of at least one gene chosen from the genes in Table 8.

[0017] In one aspect, step (b) includes detecting expression of:11-beta-hydroxysteroid dehydrogenase type 2, tissue factor gene, PCIgene (plasminogen activator inhibitor 3), MAD-3 Ikβ-alpha, Niemann-PickC disease (NPC1), platelet-type phosphofructokinase, phenylethanolaminen-methyltransferase (PNMT), transforming growth factor-beta 3(TGF-beta3), Monocyte Chemotactic Protein 1, delta sleep inducingpeptide (related to TSC-22), and estrogen receptor-related protein(hERRa1). In another aspect, step (b) includes detecting expression of:growth arrest-specific protein (gas6), tissue factor gene, NF-IL6-beta(C/EBPbeta), PCI gene (plasminogen activator inhibitor), Stat5A,calcium-binding protein S100P, MSX-2, lipocortin II (calpactin I),selenium-binding protein (hSBP), and bullous pemphigoid antigen (plakinfamily). In another aspect, step (b) includes detecting expression ofphenylethanolamine n-methyltransferase (PNMT) adrenal medulla. Inanother aspect, step (b) includes detecting expression ofproteasome-like subunit MECL-1. In another aspect, step (b) includesdetecting expression of: growth arrest-specific protein and tissuefactor gene.

[0018] In each of the above-described methods, the progesterone receptorcan be PR-A, PR-B or both PR-A and PR-B.

[0019] In one aspect of the above-described methods, the step (b) ofdetecting comprises detecting expression of at least five genes from anyone or more of the Tables 1-7. In another aspect, the step (b) ofdetecting comprises detecting expression of at least ten genes from anyone or more of the Tables 1-7. In yet another aspect, the step (b) ofdetecting comprises detecting expression of at least 15 genes from anyone or more of the Tables 1-7.

[0020] In one aspect of the above-described methods, the progesteronereceptor is expressed by a cell. In this aspect, the progesteronereceptor is endogenously expressed by the cell or recombinantlyexpressed by the cell. In one embodiment, cell is part of a tissue froma test animal. In this embodiment, the step of contacting is performedby administration of the putative agonist ligand to the test animal orto the tissue of the test animal.

[0021] In another aspect of the above-described methods, expression ofthe at least one gene is detected by measuring amounts of transcripts ofthe at least one gene before and after contact of the progesteronereceptor with the putative agonist ligand. In one aspect, expression ofthe at least one gene is detected by detecting hybridization of at leasta portion of the at least one gene or a transcript thereof to a nucleicacid molecule comprising a portion of the at least one gene or atranscript thereof in a nucleic acid array. In another aspect,expression of the at least one gene is detected by measuring expressionof a reporter gene that is operatively linked to at least the regulatoryregion of the at least one gene. In another aspect, expression of the atleast one gene is detected by detecting the production of a proteinencoded by the at least one gene.

[0022] In yet another aspect of the above-described methods, theputative agonist ligand is a product of rational drug design.

[0023] Yet another embodiment of the present invention relates to amethod to identify isoform-specific agonists of progesterone receptors.This method includes the steps of: (a) contacting a progesteronereceptor with a putative agonist ligand, wherein the progesteronereceptor is selected from the group consisting of progesterone receptorA (PR-A) and progesterone receptor B (PR-B), under conditions wherein inthe absence of the putative agonist ligand, the progesterone receptor isnot activated; (b) detecting expression of at least one gene that isregulated by the progesterone receptor when the progesterone receptor isactivated; and, (c) comparing the expression of the at least one gene inthe presence and in the absence of the putative agonist ligand, whereindetection of regulation of the expression of the at least one gene inthe manner associated with activation of the progesterone receptor asset forth in (b)(i) but not (b)(ii), indicates that the putative agonistligand is a PR-A-specific agonist, and wherein detection of regulationof the expression of the at least one gene in the manner associated withactivation of the progesterone receptor as set forth in (b)(ii) but not(b)(i), indicates that the putative agonist ligand is a PR-B-specificagonist.

[0024] Another embodiment of the present invention relates to a methodto identify isoform-specific antagonists of progesterone receptors. Thismethod includes the steps of: (a) contacting a progesterone receptorwith a putative antagonist ligand, wherein the progesterone receptor isselected from the group consisting of progesterone receptor A (PR-A) andprogesterone receptor B (PR-B), under conditions wherein, in the absenceof the putative antagonist ligand, the progesterone receptor isactivated; (b) detecting expression of at least one gene that isregulated by the progesterone receptor when the progesterone receptor isactivated; and, (c) comparing the expression of the at least one gene inthe presence and in the absence of the putative antagonist ligand,wherein, in the presence of the putative antagonist ligand, detection ofinhibition or reversal of the regulation of expression of the at leastone gene as compared to the regulation of expression of the at least onegene in the manner associated with activation of the progesteronereceptor as set forth in (b)(i) but not (b)(ii), indicates that theputative antagonist ligand is a PR-A-specific antagonist, and wherein,in the presence of the putative antagonist ligand, detection ofinhibition or reversal of the regulation of expression of the at leastone gene as compared to the regulation of the expression of the at leastone gene in the manner associated with activation of the progesteronereceptor as set forth in (b)(ii) but not (b)(i), indicates that theputative antagonist ligand is a PR-B-specific antagonist.

[0025] In each of the above-described methods of identifying aisoform-specific regulator of progesterone receptors, the progesteronereceptor can include PR-A, PR-B, or both PR-A and PR-B. The at least onegene is selected from the group consisting of: (i) at least one genethat is exclusively upregulated or downregulated by PR-A, chosen from aTable selected from the group consisting of Table 1 and Table 2; and,(b) at least one gene that is exclusively upregulated or downregulatedby PR-B chosen from a Table selected from the group consisting of Table3 and Table 4. In one aspect, the step (b) of detecting comprisesdetecting expression of at least five genes from any one or more of theTables 1-4. In another aspect, the step (b) of detecting comprisesdetecting expression of at least ten genes from any one or more of theTables 1-4. In yet another aspect, the step (b) of detecting comprisesdetecting expression of at least 15 genes from any one or more of theTables 1-4.

[0026] Another embodiment of the present invention relates to a methodto identify a tissue-specific agonist of a progesterone receptor. Thisembodiment includes the steps of: (a) providing an expression profilefor at least one gene that is known to be regulated by a progesteronereceptor in both a first and second tissue type when the progesteronereceptor is activated, wherein the at least one gene is chosen from thegenes in any one or more of the genes in Tables 1-7; (b) contacting aprogesterone receptor expressed by a first tissue type with a putativeagonist ligand, wherein the progesterone receptor is selected from thegroup consisting of progesterone receptor A (PR-A) and progesteronereceptor B (PR-B), under conditions wherein, in the absence of theputative agonist ligand, the progesterone receptor is not activated; (c)contacting a progesterone receptor expressed by a second tissue typewith the putative agonist ligand under conditions wherein, in theabsence of the putative agonist ligand, the progesterone receptor is notactivated, wherein the progesterone receptor is the same isoform as theprogesterone receptor contacted in (b); (d) detecting expression of theat least one gene from (a); (e) comparing the expression of the at leastone gene in the presence and in the absence of the putative agonistligand in each of the first and second tissue types, wherein detectionof regulation of the expression of the at least one gene in one of thefirst or second tissue types in the manner associated with activation ofthe progesterone receptor as set forth in the expression profile of (a),and detection of inhibition of regulation or no regulation of the atleast one gene in the other of the first or second tissue types, ascompared to the expression of the at least one gene associated withactivation of the progesterone receptor as set forth in the expressionprofile of (a), indicates that the putative agonist ligand is atissue-specific progesterone receptor agonist.

[0027] Yet another embodiment relates to a method to identify atissue-specific antagonist of a progesterone receptor. This methodincludes the steps of: (a) providing an expression profile for at leastone gene that is known to be regulated by a progesterone receptor inboth a first and second tissue type when the progesterone receptor isactivated, wherein the at least one gene is chosen from the genes in anyone or more of the genes in Tables 1-7; (b) contacting a progesteronereceptor expressed by a first tissue type with a putative antagonistligand, wherein the progesterone receptor is selected from the groupconsisting of progesterone receptor A (PR-A) and progesterone receptor B(PR-B), under conditions wherein, in the absence of the putativeantagonist ligand, the progesterone receptor is activated; (c)contacting a progesterone receptor expressed by a second tissue typewith the putative antagonist ligand, wherein the progesterone receptoris selected from the group consisting of progesterone receptor A (PR-A)and progesterone receptor B (PR-B), under conditions wherein, in theabsence of the putative antagonist ligand, the progesterone receptor isactivated; (d) detecting expression of the at least one gene from (a);and, (e) comparing the expression of the at least one gene in thepresence and in the absence of the putative antagonist ligand in each ofthe first and second tissue types, wherein detection of regulation ofthe expression of the at least one gene in one of the first or secondtissue types in the manner associated with activation of theprogesterone receptor as set forth in the expression profile of (a) inthe presence of the putative antagonist ligand, and detection ofinhibition or reversal of regulation of expression of the at least onegene in the other of the first or second tissue types in the presence ofthe putative antagonist ligand, as compared to the expression of the atleast one gene associated with activation of the progesterone receptoras set forth in the expression profile of (a), indicates that theputative antagonist ligand is a tissue-specific progesterone receptorantagonist.

[0028] Another embodiment of the present invention relates to a methodto identify a tissue-specific agonist of a progesterone receptor. Thismethod includes the steps of: (a) providing an expression profile for atleast one gene that is known to be regulated by a progesterone receptorin a first tissue type but not a second tissue type when theprogesterone receptor is activated, wherein the at least one gene ischosen from the genes in any one or more of the genes in Tables 1-7; (b)contacting a progesterone receptor expressed by the first tissue typewith a putative agonist ligand, wherein the progesterone receptor isselected from the group consisting of progesterone receptor A (PR-A) andprogesterone receptor B (PR-B), under conditions wherein, in the absenceof the putative agonist ligand, the progesterone receptor is notactivated; (c) detecting expression of the at least one gene from (a);(d) comparing the expression of the at least one gene in the presenceand in the absence of the putative agonist ligand in the first tissuetype, wherein detection of regulation of the expression of the at leastone gene in the first tissue type in the manner associated withactivation of the progesterone receptor as set forth in the expressionprofile of (a) indicates that the putative agonist ligand is atissue-specific progesterone receptor agonist for the first tissue type.

[0029] Yet another embodiment of the present invention relates to amethod to identify a tissue-specific antagonist of a progesteronereceptor. This method includes the steps of: (a) providing an expressionprofile for at least one gene that is known to be regulated by aprogesterone receptor in a first but not in a second tissue type whenthe progesterone receptor is activated, wherein the at least one gene ischosen from the genes in any one or more of the genes in Tables 1-7; (b)contacting a progesterone receptor expressed by a first tissue type witha putative antagonist ligand, wherein the progesterone receptor isselected from the group consisting of progesterone receptor A (PR-A) andprogesterone receptor B (PR-B), under conditions wherein, in the absenceof the putative antagonist ligand, the progesterone receptor isactivated; (c) detecting expression of the at least one gene from (a);and, (d) comparing the expression of the at least one gene in thepresence and in the absence of the putative antagonist ligand in thefirst tissue type, wherein detection of inhibition or reversal ofregulation of expression of the at least one gene in the first tissuetype in the presence of the putative antagonist ligand, as compared tothe expression of the at least one gene associated with activation ofthe progesterone receptor as set forth in the expression profile of (a),indicates that the putative antagonist ligand is a tissue-specificprogesterone receptor antagonist of the first tissue type.

[0030] In each of the above-described methods to identify atissue-specific regulator of a progesterone receptor, in one aspect, thefirst tissue type is breast, and wherein the at least one gene isselected from the group consisting of: (i) at least one gene that isselectively upregulated by PR-A chosen from a gene in Table 1; (ii) atleast one gene that is selectively downregulated by PR-A chosen from agene in Table 2; (iii) at least one gene that is selectively upregulatedby PR-B chosen from a gene in Table 3; (iv) at least one gene that isselectively downregulated by PR-B chosen from a gene in Table 4; (v) atleast one gene that is upregulated or downregulated by both PR-A andPR-B chosen from a gene in Table 5; (vi) at least one gene that isreciprocally regulated by PR-A and PR-B chosen from a gene in Table 6;and, (vii) at least one gene that is regulated by one of the PR-A or thePR-B, wherein regulation of the gene is altered when the other of thePR-A or PR-B is expressed by the same cell, chosen from a gene in Table7. In one aspect, the second tissue type is selected from the groupconsisting of breast, uterus, bone, cartilage, cardiovascular tissues,heart, lung, brain, meninges, pituitary, ovary, oocyte, corpus luteum,oviduct, fallopian tubes, T lymphocytes, B lymphocytes, thymocytes,salivary gland, placenta, skin, gut, pancreas, liver, testis,epididymis, bladder, urinary tract, eye, and teeth. In one aspect, thefirst tissue type is a non-malignant tissue and wherein the secondtissue type is a malignant tissue from the same tissue source as thefirst tissue type. A preferred tissue source is breast tissue. Inanother aspect, the first tissue type is a normal tissue and wherein thesecond tissue type is a non-malignant, abnormal tissue.

[0031] In each of the above-described methods for identifying atissue-specific regulator of a progesterone receptor, the expressionprofile of genes regulated by a progesterone receptor in the first orsecond tissue type can be provided by a method comprising: (a) providinga first cell of a selected tissue type that expresses a progesteronereceptor A (PR-A) and not a progesterone receptor B (PR-B) and a secondcell of the same tissue type that expresses PR-B and not PR-A; (b)stimulating the progesterone receptors in (a) by contacting the firstand second cells with a progesterone receptor stimulatory ligand; (c)detecting expression of genes by the first and second cells in thepresence of the stimulatory ligand and in the absence of the stimulatoryligand, wherein a difference in the expression of a gene in the presenceof the stimulatory ligand as compared to in the absence of thestimulatory ligand, indicates that the gene is regulated by theprogesterone receptor in the selected tissue type.

[0032] Another embodiment of the present invention relates to method todetermine the profile of genes regulated by progesterone receptors in abreast tumor sample. This method includes the steps of: (a) obtainingfrom a patient a breast tumor sample; (b) detecting expression of atleast one gene in the breast tumor sample that is regulated by aprogesterone receptor when the progesterone receptor is activated; and,(c) producing a profile of genes for the tumor sample that are regulatedselectively by PR-A, selectively by PR-B, or by both PR-A and PR-B. Theat least one gene is selected from the group consisting of: (i) at leastone gene that is selectively upregulated by PR-A chosen from a gene inTable 9; (ii) at least one gene that is selectively downregulated byPR-A chosen from a gene in Table 10; (iii) at least one gene that isselectively upregulated by PR-B chosen from a gene in Table 11; (iv) atleast one gene that is selectively downregulated by PR-B chosen from agene in Table 12; (v) at least one gene that is upregulated ordownregulated by both PR-A and PR-B chosen from a gene in Table 13; (vi)at least one gene that is reciprocally regulated by PR-A and PR-B chosenfrom a gene in Table 14; and, (vii) at least one gene that is regulatedby one of the PR-A or the PR-B, wherein regulation of the gene isaltered when the other of the PR-A or PR-B is expressed by the samecell, chosen from a gene in Table 15.

[0033] Yet another embodiment of the present invention relates to aplurality of polynucleotides for the detection of the expression ofgenes regulated by progesterone receptors in breast tissue. Theplurality of polynucleotides consists of polynucleotide probes that arecomplementary to RNA transcripts, or nucleotides derived therefrom, ofgenes that are regulated by progesterone receptors. The plurality ofpolynucleotides also comprises polynucleotide probes that arecomplementary to RNA transcripts, or nucleotides derived therefrom, ofgenes selected from the group consisting of: (a) at least one gene thatis selectively upregulated by PR-A chosen from a gene in Table 1; (b) atleast one gene that is selectively downregulated by PR-A chosen from agene in Table 2; (c) at least one gene that is selectively upregulatedby PR-B chosen from a gene in Table 3; (d) at least one gene that isselectively downregulated by PR-B chosen from a gene in Table 4; (e) atleast one gene that is upregulated or downregulated by both PR-A andPR-B chosen from a gene in Table 5; (e) at least one gene that isreciprocally regulated by PR-A and PR-B chosen from a gene in Table 6;and, (f) at least one gene that is regulated by one of the PR-A or thePR-B, wherein regulation of the gene is altered when the other of thePR-A or PR-B is expressed by the same cell, chosen from a gene in Table7.

[0034] In one aspect, the polynucleotide probes are immobilized on asubstrate. In another aspect, the polynucleotide probes are hybridizablearray elements in a microarray. In another aspect, the polynucleotideprobes are conjugated to detectable markers. In yet another aspect, theplurality of polynucleotides further comprises at least onepolynucleotide probe that is complementary to RNA transcripts, ornucleotides derived therefrom, of at least one gene chosen from thegenes in Table 8.

[0035] Another embodiment of the present invention relates to aplurality of antibodies, or antigen binding fragments thereof, for thedetection of the expression of genes regulated by progesterone receptorsin breast tissue. The plurality of antibodies, or antigen bindingfragments thereof, consists of antibodies, or antigen binding fragmentsthereof, that selectively bind to proteins encoded by genes that areregulated by progesterone receptors. The plurality of antibodies, orantigen binding fragments thereof, also comprises antibodies, or antigenbinding fragments thereof, that selectively bind to proteins encoded bygenes selected from the group consisting of: (a) at least one gene thatis selectively upregulated by PR-A chosen from a gene in Table 1; (b) atleast one gene that is selectively down-regulated by PR-A chosen from agene in Table 2; (c) at least one gene that is selectively upregulatedby PR-B chosen from a gene in Table 3; (d) at least one gene that isselectively downregulated by PR-B chosen from a gene in Table 4; (e) atleast one gene that is upregulated or downregulated by both PR-A andPR-B chosen from a gene in Table 5; (e) at least one gene that isreciprocally regulated by PR-A and PR-B chosen from a gene in Table 6;and, (f) at least one gene that is regulated by one of the PR-A or thePR-B, wherein regulation of the gene is altered when the other of thePR-A or PR-B is expressed by the same cell, chosen from a gene in Table7.

[0036] In one aspect, the plurality of antibodies, or antigen bindingfragments thereof, further comprises at least one antibody, or anantigen binding fragment thereof, that selectively binds to a proteinencoded by a gene chosen from the genes in Table 8.

[0037] Another embodiment of the present invention relates to a methodto identify genes that are regulated by a progesterone receptor in twoor more tissue types. This method includes the steps of: (a) activatinga progesterone receptor in two or more tissue types that express theprogesterone receptor; (b) detecting expression of at least one gene thetwo or more tissue types, the at least one gene being chosen from a genein any one or more of Tables 1-7, and, (c) identifying genes that areregulated by the progesterone receptor in each of the two or more tissuetypes. This method can further include the step of detecting whether thegenes are regulated selectively by PR-A, selectively by PR-B, or by bothPR-A and PR-B.

[0038] Another embodiment of the present invention relates to a methodto regulate the expression of a gene selected from the group consistingof any one or more of the genes in Tables 1-7. The method includesadministering to a cell that expresses a progesterone receptor acompound selected from the group consisting of: progesterone, aprogestin, and an antiprogestin, wherein the compound is effective toregulate the expression of the gene. In one embodiment, the gene isselected from the group consisting of: growth arrest-specific protein(gas6), NF-IL6-beta (C/EBPbeta), calcium-binding protein S100P, MSX-2,selenium-binding protein (hSBP), and bullous pemphigoid antigen (plakinfamily). In another embodiment, the cell that expresses a progesteronereceptor is in the breast tissue of a patient that has, or is at risk ofdeveloping, breast cancer.

DETAILED DESCRIPTION OF THE INVENTION

[0039] The present invention generally relates to the identification ofa large number of genes that are regulated by progesterone receptors,and particularly, to the identification of how these genes are regulatedby the progesterone receptor isoforms, PR-A and PR-B. Using the geneexpression profiles disclosed herein, one can identify novel ligands ofprogesterone receptors (both progestin-like agonists andanti-progestin-like antagonists) that regulate progesterone receptors,including in an isoform-specific and/or tissue specific manner. Inaddition, these genes can be used to profile individuals that have beendiagnosed with breast cancer to enhance the ability of the clinician todevelop a prognosis and treatment protocols for the individual patient.The genes can also be used to profile the progesterone receptorregulated gene expression in tissue types other than breast tissue.Moreover, given the knowledge of these genes, one can produce novelcombinations of polynucleotides and/or antibodies and/or peptides foruse in progestational drug screening assays or expression profiling ofpatient samples.

[0040] The present inventors have generated model systems to study PRsin breast cancer cells, that are unique to the present inventors'laboratory. In most target tissues, including the breast and uterus, PRsare induced by estradiol. Thus, one can only study progestin actions inthe background of an estrogenized system. This makes it virtuallyimpossible to dissect out responses that are due to progesterone, fromthose that are due to estrogens. Furthermore, all these target tissuescontain both PR-A and PR-B. This makes it impossible to dissect out theeffects of each PR isoform independently. The T47Dco breast cancer cellsare unique to the present inventors' laboratory. They have retained PRand express both PR-A and PR-B at equal levels (Horwitz et al., Cell 28,633-42 (1982)). However, the PRs in these cells are constitutivelyregulated without estrogens. In order to study differential generegulation by the two PR isoforms independently, the present inventorsconstructed a model system in which a PR-negative subline (termedT47D-Y), was derived from T47Dco breast cancer cells. T47D-4 cells werethen engineered to stably express either PR-B (termed T47D-4B) or PR-A(termed T47D-4A) at equal levels to each other and to the parentalT47Dco cells (Sartorius et al., Cancer Res. 54, 3668-3877 (1994)). Thepresent inventors have now used these three new cell lines to analyzeprogesterone-responsive gene regulation via PR-B or PR-A (with PRnegative T47D-Y cells serving as a control) using Affymetrix™ microarrayHFL6800 gene expression chips and Atlas™ Human cDNA Expression Arrays.In addition to confirming the regulation of the few knownprogesterone-responsive genes, the present inventors have identifiedmany genes not previously known to be regulated by PR. Importantly, theresults described herein now allow discrimination of genes that areregulated uniquely by PR-B from genes that are uniquely regulated byPR-A. It was found that PR-B regulate more genes than PR-A in responseto progesterone, but that a number of genes are uniquely regulated byPR-A. Many of these results have been confirmed by northern blotanalysis or RT-PCR of the gene transcripts, or by western blot analysesof the protein products. The data presented herein demonstrate that thetwo PR isoforms do indeed have unique roles in gene regulation in breastcancer cells. Lastly, the present inventors have observed that theexpression levels of a subset of genes are modified by the presence ofPR in a ligand-independent fashion.

[0041] Genes Regulated by Progesterone Receptors:

[0042] Of the more than 6000 human genes screened, the present inventorshave identified multiple genes, the expression of which is regulated byprogesterone receptors. The genes can be grouped into categories basedon the regulation of expression of the genes by the progesteronereceptor isoforms, PR-A and PR-B. More particularly, the genes have beengrouped into the following main categories: (1) Genes that areselectively (i.e., exclusively or uniquely) upregulated by PR-A (Tables1 and 9); (2) genes that are selectively downregulated by PR-A (Tables 2and 10); (3) genes that are selectively upregulated by PR-B (Tables 3and 11); (4) genes that are selectively downregulated by PR-B (Tables 4and 12); (5) genes that are upregulated or downregulated in the samedirection by both PR-A and PR-B (Tables 5 and 13); (6) genes that arereciprocally regulated by PR-A and PR-B (Tables 6 and 14); and (7) genesthat are regulated by one of the isoforms, wherein such regulation isaltered when the other isoform is present (e.g., the expression of thegene is either up- or downregulated in the presence of both receptorsrelative to the expression level of the gene in the presence of only onereceptor) (Tables 7 and 15). In this last category, the gene ischaracterized in that the regulation of expression of the gene by oneisoform is altered or suppressed by the presence of the other isoform.It is noted that genes in this last category can also fall within one ofthe other 6 categories. Tables 1-7 include all genes that were newlydiscovered to be regulated by progesterone receptors by the presentinventors. Tables 9-15 include all of the genes from Tables 1-7,respectively, and additionally include the genes that were identified bythe present inventors that had previously been identified to beregulated generally by progesterone. This particular subset of genes(i.e., previously known to be regulated by progesterone) is also setforth separately in Table 8. It is noted that even though the genes inTable 8 were known to be regulated by progesterone, the isoformspecificity of these genes was not previously known. Therefore, theidentification of the PR isoform regulation of the genes in Table 8 isnovel. Other categories of the genes identified in the present inventionare as follows: Table 16 is a list of genes identified in the presentinvention which were previously known to be involved in breast cancer orin the development of mammary tissue. Table 17 is a list thatcategorizes the genes shown to be regulated by progesterone by thepresent inventors into functional categories based on GeneCardinformation as well as extensive literature reviews of each geneproduct. Table 18 (See Example 1) shows the cumulative results of thegene array analysis with regard to the PR-B-expressing cells describedin the Examples. Table 19 (See Example 1) shows the cumulative resultsof the gene array analysis with regard to the PR-A-expressing cellsdescribed in the Examples.

[0043] Accordingly, in one embodiment of the present invention, thegenes identified as being regulated by progesterone receptors by thepresent inventors can be used as endpoints or markers in a method toidentify ligands that regulate progesterone receptor activity. Accordingto the present invention, in general, the biological activity orbiological action of a protein such as a progesterone receptor refers toany function(s) exhibited or performed by the protein that is ascribedto the naturally occurring form of the protein as measured or observedin vivo (i.e., in the natural physiological environment of the protein)or in vitro (i.e., under laboratory conditions). In particular, thebiological activity of a progesterone receptor that is of interestherein includes the effect of the receptor, particularly when thereceptor is activated, on the expression of the downstream genesidentified in the present invention. According to the present invention,a “downstream gene” or “endpoint gene” is any gene, the expression ofwhich is regulated (up or down) by a progesterone receptor (PR-A and/orPR-B). The expression of the gene is typically regulated by theprogesterone receptor when it is activated, although the expression ofthe gene may be regulated by the progesterone receptor in the absence ofa stimulatory compound (i.e., the regulation may be ligand independent,or constitutive). Pharmaceutical companies are keenly interested inscreening their vast libraries of chemical compounds for ones that bindto (ligands), and either activate or inhibit, progesterone receptors.Selected sets of one, two, three, or more of the genes (up to the numberequivalent to all of the genes) of this invention can be used asend-points for rapid through-put screening of ligands that specificallyand selectively influence the activity of PR-A and/or PR-B. The ligandscan be either agonists or antagonists of the progesterone receptor.

[0044] As used herein, the phrase “PR agonist ligand” or “PR agonist”refers to any compound that interacts with a PR and elicits anobservable response. More particularly, a PR agonist can include, but isnot limited to, steroidal or non-steroidal compounds; a protein,peptide, or nucleic acid that selectively binds to and activates orincreases the activation of a progesterone receptor; and most commonlyincludes progesterone, progesterone analogs, and any suitable product ofdrug design (e.g., a mimetic of progesterone, or a synthetic progestin)which is characterized by its ability to agonize (e.g., stimulate,induce, increase, enhance) the biological activity of a naturallyoccurring progesterone receptor in a manner similar to the naturalagonist, progesterone (e.g., by interaction/binding with and/or director indirect activation of a progesterone receptor). It is noted that theterm “progestin” as used herein is generally intended to includeprogesterone as well as any progesterone analog, such as a syntheticprogestin. Since the progesterone receptor is an intracellular receptor,a suitable agonist typically does not include an antibody or antigenbinding fragment thereof, but to the extent that an antibody thatselectively binds to and activates or increases the activation of aprogesterone receptor can be designed and implemented as an agonist,such a compound is also contemplated. It is noted that the effect of theaction of a given PR agonist on the expression of a downstream gene maybe the downregulation of the gene or the suppression of the expressionof a gene (e.g., when both isoforms of PR are present). Moreover, theaction of the agonist on a PR may have undesirable consequences in onetissue type and beneficial consequences in another tissue type. However,the term agonist is intended to refer to the ability of the ligand toact on a progesterone receptor in a manner that is substantially similarto the action of the natural PR ligand, progesterone, on theprogesterone receptor (described in more detail below). Typically, a PRagonist is identified under conditions wherein, in the absence of theagonist, the PR receptor is not activated, or is at least believed notto be in the presence of a compound that is known to activate thereceptor, such as the natural ligand progesterone or a known progestin.

[0045] The phrase, “PR antagonist ligand” or “PR antagonist” refers toany compound which inhibits the effect of a PR agonist, as describedabove. More particularly, a PR antagonist is capable of associating witha progesterone receptor such that the biological activity of thereceptor is decreased (e.g., reduced, inhibited, blocked, reversed,altered) in a manner that is antagonistic (e.g., against, a reversal of,contrary to) to the action of the natural agonist, progesterone, on thereceptor. Such a compound can include, but is not limited to, steroidalor non-steroidal compounds; a protein, peptide, or nucleic acid thatselectively binds to and blocks access to the receptor by a natural orsynthetic agonist ligand or reduces or inhibits the activity of aprogesterone receptor; or a product of drug design that blocks thereceptor or alters the biological activity of the receptor (e.g., anantiprogestin, which antagonizes the actions of progesterone). Again,since the progesterone receptor is an intracellular receptor, antibodyantagonists are typically not practical, although if appropriate andfeasible, their use is contemplated herein. It is noted that the actionof a given PR antagonist on a given downstream gene via a PR may be toactually upregulate the gene. Moreover, the action of the antagonist ona PR may have undesirable consequences in one tissue type and beneficialconsequences in another tissue type. However, the term antagonist isintended to refer to the ability of the ligand to act on a progesteronereceptor in a manner that is antagonistic to the action of the naturalPR ligand, progesterone, or a synthetic PR agonist, on the progesteronereceptor. Typically, an antagonist is identified under controlconditions wherein, in the absence of the antagonist, the progesteronereceptor is stimulated, such as by the natural ligand, progesterone, orby any suitable progestin. In one embodiment, a PR antagonist can beidentified by its ability to alter the regulation of downstream genes bythe receptor in the absence of a known stimulator of the receptor. Inthis embodiment, ligand-independent regulators of progesterone receptorfunction can be identified by detecting effects on genes that areconstitutively regulated by PR in the ligand-unactivated state.

[0046] According to the present invention, agonists and antagonistligands can include any regulatory ligand or compound that has theabove-mentioned characteristics with regard to regulation of aprogesterone receptor. For example, agonists and antagonists can includesteroidal and non-steroidal compounds, proteins and peptides, nucleicacid molecules, antibodies, and/or mimetics (e.g., products of drugdesign or combinatorial chemistry).

[0047] Natural sex steroid hormone agonists are low molecular weightringed cyclopentanophenanthrene compounds that in mammals includeprogesterone, estrogens and androgens. Steroid agonists can be extractedfrom a variety of natural sources, including the ovaries and testes.With the aim of enhancing the properties of natural steroid compounds,researchers have modified the cyclopentanophenanthrene structures and/oraltered the substituent side-chains to generate semi-synthetic andsynthetic steroidal and non-steroidal compounds. Non-steroidal compoundslack the classical cyclopentanophenanthrene structure. Nevertheless, allof these compounds—natural, semi-synthetic and synthetic, steroidal andnon-steroidal compounds, bind to their respective nuclear receptors.Modified compounds can be either agonists or antagonists.

[0048] Progesterone is the natural “progestin” produced by the ovariesand adrenal glands of mammals. Semi-synthetic or synthetic analogs thathave progesterone-like effects, can be either steroidal ornon-steroidal. They are also included in the generic category called“progestins.” Natural, semi-synthetic or synthetic progestins bind tointracellular, usually intranuclear, progesterone receptors. Suchprogestins can be either “agonists” or “antagonists” (antiprogestins).Both agonists and antagonists can have variable levels of activity ofthe receptors. An agonist can be strong or weak with many levels inbetween. An antagonist can also be strong or weak. Some antagonists mayhave “mixed” agonist/antagonist properties. The present invention canscreen for all of these types of progestins.

[0049] Other compounds in addition to steroidal and non-steroidalcompounds can bind progesterone receptors. These include proteins andpeptides, and nucleic acids and fragments thereof. Any compound thatbinds a receptor can be classified as a “ligand” of the receptor. If theligand influences the activity of the progesterone receptor, the presentinvention can be used to screen for such ligand(s).

[0050] An isolated protein, according to the present invention, is aprotein (including a peptide) that has been removed from its naturalmilieu (i.e., that has been subject to human manipulation) and caninclude purified proteins, partially purified proteins, recombinantlyproduced proteins, and synthetically produced proteins, for example. Assuch, “isolated” does not reflect the extent to which the protein hasbeen purified. An isolated protein useful as an antagonist or agonistaccording to the present invention can be isolated from its naturalsource, produced recombinantly or produced synthetically. Smallerpeptides useful as regulatory ligands are typically producedsynthetically by methods well known to those of skill in the art.Regulatory ligands of the present invention can also include an antibodyor antigen binding fragment that selectively binds to a progesteronereceptor.

[0051] According to the present invention, the phrase “selectively bindsto” refers to the ability of an antibody, antigen binding fragment orother binding partner (protein, peptide, nucleic acid) to preferentiallybind to specified proteins. More specifically, the phrase “selectivelybinds” refers to the specific binding of one protein to another, whereinthe level of binding, as measured by any standard assay, isstatistically significantly higher than the background control for theassay.

[0052] Agonists and antagonists that are products of drug design can beproduced using various methods known in the art. Various methods of drugdesign, useful to design mimetics or other regulatory compounds usefulin the present invention are disclosed in Maulik et al., 1997, MolecularBiotechnology: Therapeutic Applications and Strategies, Wiley-Liss,Inc., which is incorporated herein by reference in its entirety. A PRagonist or antagonist can be obtained, for example, from moleculardiversity strategies (a combination of related strategies allowing therapid construction of large, chemically diverse molecule libraries),libraries of natural or synthetic compounds, in particular from chemicalor combinatorial libraries (i.e., libraries of compounds that differ insequence or size but that have the similar building blocks) or byrational, directed or random drug design. See for example, Maulik etal., supra.

[0053] In a molecular diversity strategy, large compound libraries aresynthesized, for example, from peptides, oligonucleotides, natural orsynthetic steroidal compounds, carbohydrates and/or natural or syntheticorganic and non-steroidal molecules, using biological, enzymatic and/orchemical approaches. The critical parameters in developing a moleculardiversity strategy include subunit diversity, molecular size, andlibrary diversity. The general goal of screening such libraries is toutilize sequential application of combinatorial selection to obtainhigh-affinity ligands for a desired target, and then to optimize thelead molecules by either random or directed design strategies. Methodsof molecular diversity are described in detail in Maulik, et al., ibid.

[0054] As used herein, the term “mimetic” is used to refer to anynatural or synthetic steroidal compound, peptide, oligonucleotide,carbohydrate and/or natural or synthetic organic and non-steroidalmolecule that is able to mimic the biological action of a naturallyoccurring or known synthetic progestin.

[0055] Methods and Products of the Present Invention:

[0056] One embodiment of the present invention relates to a method toidentify agonist ligands of progesterone receptors. This method includesthe steps of: (a) contacting a progesterone receptor with a putativeagonist ligand, wherein the progesterone receptor is selected from thegroup consisting of progesterone receptor A (PR-A) and progesteronereceptor B (PR-B), under conditions wherein, in the absence of theputative agonist ligand, the progesterone receptor is not activated; (b)detecting expression of at least one gene that is regulated by theprogesterone receptor when the progesterone receptor is activated and,(c) comparing the expression of the at least one gene in the presenceand in the absence of the putative agonist ligand, wherein detection ofregulation of the expression of the at least one gene in the mannerassociated with activation of the progesterone receptor as set forth in(b) indicates that the putative agonist ligand is a progesteronereceptor agonist. The gene can include any one or more of any of thefollowing genes: (i) one or more of the genes that are selectivelyupregulated by PR-A chosen from a gene in Table 1; (ii) one or more ofthe genes that are selectively downregulated by PR-A chosen from a genein Table 2; (iii) one or more of the genes that are selectivelyupregulated by PR-B chosen from a gene in Table 3; (iv) one or more ofthe genes that are selectively downregulated by PR-B chosen from a genein Table 4; (v) one or more of the genes that are upregulated ordownregulated in the same direction by both PR-A and PR-B chosen from agene in Table 5; (vi) one or more of the genes that are reciprocallyregulated by PR-A and PR-B chosen from a gene in Table 6; and, (vii) oneor more of the genes that are regulated by one of either PR-A or PR-B,wherein the regulation of the gene is altered when the other of the PR-Aor PR-B is present, such a gene being chosen from a gene in Table 7.

[0057] Another embodiment of the present invention relates to a methodto identify antagonists of progesterone receptor. This method includesthe steps of: (a) contacting a progesterone receptor with a putativeantagonist ligand, wherein the progesterone receptor is selected fromthe group consisting of progesterone receptor A (PR-A) and progesteronereceptor B (PR-B), under conditions wherein, in the absence of saidputative antagonist ligand, said progesterone receptor is activated(i.e., before, simultaneously with or after the contact of the receptorwith the putative regulatory ligand); (b) detecting expression of atleast one gene that is regulated by the progesterone receptor when theprogesterone receptor is activated; and, (c) comparing the expression ofthe at least one gene in the presence and in the absence of the putativeantagonist ligand. Detection of inhibition or reversal of the regulationof expression of the at least one gene as compared to the regulation ofexpression of the at least one gene in the manner associated withactivation of the progesterone receptor as set forth in (b), indicatesthat the putative antagonist ligand is a progesterone receptorantagonist. The gene(s) to be detected in step (b) are chosen from oneor more of the following genes: (i) one or more of the genes that areselectively upregulated by PR-A chosen from a gene in Table 1; (ii) oneor more of the genes that are selectively downregulated by PR-A chosenfrom a gene in Table 2; (iii) one or more of the genes that areselectively upregulated by PR-B chosen from a gene in Table 3; (iv) oneor more of the genes that are selectively downregulated by PR-B chosenfrom a gene in Table 4; (v) one or more of the genes that areupregulated or downregulated by both PR-A and PR-B chosen from a gene inTable 5; (vi) one or more of the genes that are reciprocally regulatedby PR-A and PR-B chosen from a gene in Table 6; and, (vii) one or moreof the genes that are regulated by one of the PR-A or the PR-B, whereinregulation of the gene is altered when the other of the PR-A or PR-B isexpressed by the same cell, chosen from a gene in Table 7. In oneembodiment, the progesterone receptor is activated by contacting thereceptor with a compound that activates the receptor, the step ofcontacting being performed prior to, simultaneously with, or after thestep of contacting of (a).

[0058] The steps of the method of the present invention will now bedescribed in some detail for these embodiments of the invention;however, this discussion generally applies to other methods ofidentifying various ligands of progesterone receptors as describedbelow.

[0059] As used herein, the term “putative regulatory compound” or“putative regulatory ligand” refers to compounds having an unknownregulatory activity, at least with respect to the ability of suchcompounds to regulate progesterone receptors as described herein.

[0060] In the method of identifying a regulatory ligand (i.e., anagonist or an antagonist) according to the present invention, the methodcan be a cell-based assay, or non-cell-based assay. In one embodiment,the progesterone receptor is expressed by a cell (i.e., a cell-basedassay). In another embodiment the progesterone receptor is in a celllysate, is in isolated cell nuclei, or is purified or produced free ofcells. The progesterone receptor can be a PR-A, a PR-B, or a combinationof PR-A and PR-B. One advantage of the present invention is that, giventhe knowledge of the isoform regulation of the various downstream genesdisclosed herein, one can screen for ligands of the progesteronereceptor, including screening for isoform specific ligands, using cellsthat express both receptors. Prior to the present invention, it wasimpossible to distinguish between the effects of one isoform or theother, because most cells express both isoforms.

[0061] In one embodiment, the conditions under which a receptoraccording to the present invention is contacted with a putativeregulatory ligand, such as by mixing; are conditions in which thereceptor is not stimulated (activated) if essentially no regulatoryligand is present. For example, such conditions include normal cultureconditions in the absence of a known stimulatory compound (a stimulatorycompound being, for example, the natural ligand for the receptor(progesterone), a stimulatory peptide, or other equivalent stimulus,such as a synthetic progestin). The putative regulatory ligand is thencontacted with the receptor. In this embodiment, the step of detectingis designed to indicate whether the putative regulatory ligand altersthe biological activity of the receptor as compared to in the absence ofthe putative regulatory ligand (i.e., the background level), asdetermined by the effects of the contact between the ligand and thereceptor on the expression of downstream genes as described herein.

[0062] In an alternate embodiment, the conditions under which aprogesterone receptor according to the present invention is contactedwith a putative regulatory ligand, such as by mixing, are conditions inwhich the receptor is normally stimulated (activated) if essentially noregulatory ligand is present. Such conditions can include, for example,contact of said receptor with a stimulator molecule (a stimulatorycompound being, e.g., the natural ligand for the receptor(progesterone), a stimulatory peptide, or other equivalent stimulus,such as a synthetic progestin) which binds to the receptor and causesthe receptor to become activated. In this embodiment, the putativeregulatory ligand can be contacted with the receptor prior to, orsimultaneously with, the contact of the receptor with the stimulatorycompound (e.g., to determine whether the putative regulatory ligandblocks or otherwise inhibits the stimulation of the progesteronereceptor by the stimulatory compound), or after contact of the receptorwith the stimulatory compound (e.g., to determine whether the putativeregulatory ligand downregulates, or reduces the activation of thereceptor).

[0063] The present methods involve contacting the progesterone receptorwith the ligand being tested for a sufficient time to allow forinteraction, activation or inhibition of the receptor by the ligand. Theperiod of contact with the ligand being tested can be varied dependingon the result being measured, and can be determined by one of skill inthe art. For example, for binding assays, a shorter time of contact withthe compound being tested is typically suitable, than when activation isassessed, and particularly, when the expression of downstream genes isassessed. The methods of the present invention detect the expression ofdownstream genes and therefore, the time of incubation is dependent uponthe time required to achieve expression of the downstream genes. Such atime period is typically at least 2 hours, and more preferably at least4 hours, and more preferably at least 6 hours, although the time can beextended, if necessary to detect expression of a selected downstreamgene. As used herein, the term “contact period” refers to the timeperiod during which the progesterone receptor is in contact with theligand being tested. The term “incubation period” refers to the entiretime during which the cells expressing the receptor, for example, areallowed to grow prior to evaluation, or the time during which genesaffected by activation of the progesterone receptor are allowed to beexpressed, and such time period can be inclusive of the contact period.Thus, the incubation period includes all of the contact period and mayinclude a further time period during which the compound being tested isnot present, or is no longer being supplied to the receptor, but duringwhich gene expression is continuing (in the case of a cell based assay)prior to scoring. The incubation time for growth of cells can vary butis sufficient to allow for the binding of the progesterone receptor, theactivation or inhibition of the receptor, and the effect on theexpression of the downstream genes regulated by the receptor. It will berecognized that shorter incubation times are preferable becausecompounds can be more rapidly screened.

[0064] In accordance with the present invention, a cell-based assay isconducted under conditions which are effective to screen for regulatorycompounds useful in the method of the present invention. Effectiveconditions include, but are not limited to, appropriate media,temperature, pH and oxygen conditions that permit the growth of the cellthat expresses the receptor. An appropriate, or effective, medium refersto any medium in which a cell that naturally or recombinantly expressesa progesterone receptor, when cultured, is capable of cell growth andexpression of the progesterone receptor. Such a medium is typically asolid or liquid medium comprising growth factors and assimilable carbon,nitrogen and phosphate sources, as well as appropriate salts, minerals,metals and other nutrients, such as vitamins. Culturing is carried outat a temperature, pH and oxygen content appropriate for the cell. Suchculturing conditions are within the expertise of one of ordinary skillin the art. Exemplary cells expressing progesterone receptors aredescribed in the Examples, and in detail in (Sartorius et al., CancerRes. 54, 3668-3877 (1994)).

[0065] Cells that are useful in the cell-based assays of the presentinvention include any cell that expresses a progesterone receptor of theisoform A, isoform B, or a combination of PR-A and PR-B. Such cellsinclude cells that naturally express progesterone receptors, or cellsthat express progesterone receptors by recombinant technology. Suchcells preferably include, but are not limited to mammalian cells, whichcan originate from the breast or any other tissue. For example, tissuescontaining cells that are known to express the progesterone receptornaturally include, but are not limited to, breast, uterus, bone,cartilage, cardiovascular tissues, heart, lung, brain, meninges,pituitary, ovary, oocyte, corpus luteum, oviduct, fallopian tubes, Tlymphocytes, B lymphocytes, thymocytes, salivary gland, placenta, skin,gut, pancreas, liver, testis, epididymis, bladder, urinary tract, eye,and teeth. Cells suitable for use in a cell-based assay include normalor malignant cells, as well as cells that are not malignant, but whichare abnormal, such as cells from a non-malignant tissue that isotherwise diseased (e.g., tissues from endometriosis and leiomyoma ofthe uterus, fibrocystic disease of the breast, polycystic ovary). Othersuitable cells are cells that express PR-A, PR-B, or both isoforms, as aresult of recombinant technology. Such cells were used to discover thePR downstream genes of the present invention and are described in detailin Sartorius et al. (Sartorius et al., Cancer Res. 54, 3668-3877(1994)). Other suitable cells are cells that express a PR-A and/or aPR-B transgene (i.e., cells isolated from a transgenic animal), or cellsthat have a germline deletion of one of the PR isoforms, but not theother (i.e., cells from a PR-A or PR-B knockout animal).

[0066] According to the present invention, the method includes the stepof detecting the expression of at least one, and preferably more thanone, of the downstream genes that have now been shown to be regulated byprogesterone receptors by the present inventors. As used herein, theterm “expression”, when used in connection with detecting the expressionof a downstream gene of the present invention, can refer to detectingtranscription of the gene and/or to detecting translation of the gene.To detect expression of a downstream gene refers to the act of activelydetermining whether a gene is expressed or not. This can includedetermining whether the gene expression is upregulated as compared to acontrol, downregulated as compared to a control, or unchanged ascompared to a control. Therefore, the step of detecting expression doesnot require that expression of the gene actually is upregulated ordownregulated, but rather, can also include detecting that theexpression of the gene has not changed (i.e., detecting no expression ofthe gene or no change in expression of the gene).

[0067] The present method includes the step of detecting the expressionof at least one gene that is regulated by a progesterone receptor whenthe receptor is activated, as set forth in detail above. In a preferredembodiment, the step of detecting includes detecting the expression ofat least 2 genes, and preferably at least 3 genes, and more preferablyat least 4 genes, and more preferably at least 5 genes, and morepreferably at least 6 genes, and more preferably at least 7 genes, andmore preferably at least 8 genes, and more preferably at least 9 genes,and more preferably at least 10 genes, and more preferably at least 11genes, and more preferably at least 12 genes, and more preferably atleast 13 genes, and more preferably at least 14 genes, and morepreferably at least 15 genes, and so on, in increments of one, up todetecting expression of all of the downstream genes disclosed herein.Analysis of a number of genes greater than 1 can be accomplishedsimultaneously, sequentially, or cumulatively.

[0068] In the method of identifying an agonist or an antagonist of aprogesterone receptor of the present invention, the gene(s) to bedetected are preferably selected from the genes described in any one ormore of Tables 1-7. These tables disclose genes that are regulated byprogesterone receptors, and particularly, these tables disclose themanner in which the genes are regulated by the PR isoforms when theprogesterone receptor is activated (i.e., by a stimulator of thereceptor). The genes to be detected can include one or more of: (1)genes that are selectively (i.e., exclusively or uniquely) upregulatedby PR-A (Table 1); (2) genes that are selectively downregulated by PR-A(Table 2); (3) genes that are selectively upregulated by PR-B (Table 3);(4) genes that are selectively downregulated by PR-B (Table 4); (5)genes that are upregulated or downregulated in the same direction byboth PR-A and PR-B (Table 5); (6) genes that are reciprocally regulatedby PR-A and PR-B (Table 6); and (7) genes that are regulated by one ofthe PR isoforms, wherein such regulation is altered when the other PRisoform is present (e.g., the expression of the gene is either up- ordownregulated in the presence of both receptors relative to theexpression level of the gene in the presence of only one receptor)(Table 7). In one embodiment, the method further includes the additionaldetection of the expression of one or more genes that were previouslyknown to be regulated by progesterone, but for which the PR isoformregulation was not known until the present invention. Such genes aredisclosed in Table 8.

[0069] It is to be understood that the organization of various genesinto the present tables is for purposes of clarity and identification ofvarious genes on the basis of the manner in which the gene is regulatedby a progesterone receptor isoform. The selection of genes to bedetected in any given method can include any one or more of the genes inany one or more of the Tables, and can include the detection of anycombination of two or more of the genes in any one or more of theTables. It is not mandatory that a given assay be restricted to thedetection of all of the various genes in a single table, or to one genein each table. In addition, with regard to Tables 1-7, it is believedthat these tables encompass genes that have been identified by thepresent inventors to be regulated by progesterone receptors, but whichhave not previously been described as being regulated by progesterone.However, in the event that one or more of the genes in Tables 1-7 isfound to have previously been known to be regulated by progesterone, theremoval of such gene from these tables and the placement of such geneinto Table 8, is explicitly contemplated. This rationale also applies tothe genes of Table 16, which are believed to include all of those genesidentified by the inventors that were previously known to be involved inbreast cancer or mammary development. It is expressly contemplated thatother genes from Tables 1-7 or 9-15 can be added to Table 16, ifrequired for accuracy. Tables 9-15 include all of the genes identifiedby the present inventors as being regulated by progesterone receptors(organized by isoform regulation, as for Tables 1-7), and, as discussedpreviously herein, include genes that were previously known to beregulated by progesterone.

[0070] Given the knowledge of the genes regulated by progesteronereceptors according to the present invention, one of skill in the artwill be able to select one or more genes to detect in a method of thepresent invention, and the selection of the one or more genes can bedetermined by different factors. For example, certain subsets of thegenes are useful for detecting genes regulated by PR-A exclusively(i.e., genes in Tables 1, 2, 9 and 10). Other subsets of genes areuseful for detecting genes regulated by PR-B exclusively (i.e., genes inTables 3, 4, 11 and 12). One of skill in the art may wish to detectgenes disclosed herein that are related to a particular function, to aparticular tissue-type, or that are associated (or likely to beassociated) with a particular disease or condition. One of skill in theart may also wish to select genes on the basis of the change inexpression level in the presence of progesterone (i.e., and thereforeactivation of the PR) as compared to in the absence of progesterone.

[0071] In one aspect of the methods of the present invention, the methodof the present invention includes detecting genes of the presentinvention that are related by function. For example, Table 17 provides alisting of the various genes identified by the present inventors,categorized by function. Therefore, one could screen functional sets ofgenes to make a specific determination about a given cell or tissue thatexpresses a progesterone receptor, or to identify a ligand that has anaction that might be correlated with a functional gene. For example, onecould use subsets of the disclosed genes to screen a tumor for thelikelihood that it will metastasize by screening the genes in the “celladhesion or cytoskeletal interaction” group of Table 17. Other uses forscreening functional groups will be apparent to those of skill in theart.

[0072] In another aspect, one could detect genes that are of interest ina particular tissue type. Examples of such genes are disclosed below inthe discussion regarding the identification of tissue-specific ligandsof progesterone receptors.

[0073] In another aspect, one could detect those genes that areassociated with a particular disease, such as breast cancer. Anexemplary grouping of genes that are regulated by progesterone receptors(as disclosed herein) and that were previously known to be involved inbreast cancer or mammary gland development, are shown in Table 16. Inone embodiment, one may be interested in detecting those genes listed inTable 16 which are not also listed in Table 8.

[0074] In another aspect, it may be desirable to select those genes fordetection that are particularly highly regulated by progesteronereceptors in that they display the largest increases or decreases inexpression levels in the presence of progesterone as compared to in theabsence of progesterone. The detection of such genes can be advantageousbecause the endpoint may be more clear and require less quantitation.The relative expression levels of the genes identified in the presentinvention are listed in the tables. In these tables, the fold increaseor decrease in expression of the gene upon treatment of the progesteronereceptor with progesterone for 6 hours is indicated. The fold increaseor decrease was made with respect to the background level of expressionof the gene, which in some cases, was undetectable (i.e., the gene wasnot detected at all in the absence of progesterone, but was detected inthe presence of progesterone). Therefore, in one embodiment, one ofskill in the art might choose to detect genes that exhibited a foldincrease above background of at least 2. In another embodiment, one ofskill in the art might choose to detect genes that exhibited a foldincrease or decrease above background of at least 3, and in anotherembodiment at least 4, and in another embodiment at least 5, and inanother embodiment at least 6, and in another embodiment at least 7, andin another embodiment at least 8, and in another embodiment at least 9,and in another embodiment at least 10 or higher fold changes. It isnoted that fold increases or decreases are not typically compared fromone gene to another, but with reference to the background level for thatparticular gene.

[0075] In order to determine whether a putative regulatory compound isan agonist or antagonist of PR as defined herein, it is necessary toknow how a given gene is regulated by the PR so that one can compare theresults in the presence and absence of the putative regulatory ligand tothe gene expression profile produced by an activated receptor. Thisallows the investigator to thereby detect whether the contact of thereceptor with the putative ligand results in a profile of geneexpression that is substantially similar to the profile of geneexpression of an activated PR (i.e., agonist action), or whether contactof the receptor with the putative ligand results in a profile of geneexpression that is an inhibition, or reversal, of the profile of geneexpression of an activated PR (i.e., antagonist action).

[0076] In one aspect of the method of the present invention, the step ofdetecting can include the detection of one or more reporter genes thatare linked to promoters of one or more downstream genes according to thepresent invention. In this embodiment, the transcriptional read-out canuse one, two or more promoters of any of the genes of this invention,linked to any of several reporter constructs, which are introduced intocells by any of several established transfection or infection methods,including, but not limited to, calcium phosphate transfection,transformation, electroporation, microinjection, lipofection,adsorption, infection (e.g., by a viral vector) and protoplast fusion.The cells can be naturally PR-positive (containing both PRs), or theycan stably or transiently express either one or both of the twoPR-isoforms. The cells can be exposed to the test ligands (i.e., theputative regulatory ligands) for different times and/or concentrations,and transcription of the PR-responsive promoter(s) of the downstreamgenes disclosed in this invention can be quantified.

[0077] In another aspect of this method of the present invention, cellsexpressing a PR as described above are exposed to the unknown testligands at various concentrations and for various periods of time. Thetranscriptional read-out can be expression of one, two or more of thegenes of this invention, which are endogenously regulated in the cells.Expression of their transcripts and/or proteins is measured by any of avariety of known methods in the art several of which are exemplified inthe Examples section. For RNA expression, methods include but are notlimited to: extraction of cellular mRNA and northern blotting usinglabeled probes that hybridize to transcripts encoding all or part of oneor more of the genes of this invention; amplification of mRNA expressedfrom one or more of the genes of this invention using gene-specificprimers and reverse transcriptase-polymerase chain reaction, followed byquantitative detection of the product by any of a variety of means;extraction of total RNA from the cells, which is then labeled and usedto probe cDNAs or oligonucleotides encoding all or part of thePR-responsive genes of this invention, arrayed on any of a variety ofsurfaces.

[0078] Methods to measure protein expression levels of selected genes ofthis invention, include, but are not limited to: western blotting,immunocytochemistry, flow cytometry or other immunologic-based assays;assays based on a property of the protein including but not limited toDNA binding, ligand binding, or interaction with other protein partners.

[0079] Nucleic acid arrays are particularly useful for detecting theexpression of the downstream genes of the present invention. Theproduction and application of high-density arrays in gene expressionmonitoring have been disclosed previously in, for example, WO 97/10365;WO 92/10588; U.S. Pat. No. 6,040,138; U.S. Pat. No. 5,445,934; orWO95/35505, all of which are incorporated herein by reference in theirentireties. Also for examples of arrays, see Hacia et al. (1996) NatureGenetics 14:441-447; Lockhart et al. (1996) Nature Biotechnol.14:1675-1680; and De Risi et al. (1996) Nature Genetics 14:457-460. Ingeneral, in an array, an oligonucleotide, a cDNA, or genomic DNA, thatis a portion of a known gene occupies a known location on a substrate. Anucleic acid target sample is hybridized with an array of sucholigonucleotides and then the amount of target nucleic acids hybridizedto each probe in the array is quantified. One preferred quantifyingmethod is to use confocal microscope and fluorescent labels. TheAffymetrix GeneChip™ Array system (Affymetrix, Santa Clara, Calif.) andthe Atlas™Human cDNA Expression Array system are particularly suitablefor quantifying the hybridization; however, it will be apparent to thoseof skill in the art that any similar systems or other effectivelyequivalent detection methods can also be used. The Examples sectiondescribes the use of these two different array systems. In aparticularly preferred embodiment, one can use the knowledge of thegenes described herein to design novel arrays of polynucleotides, cDNAsor genomic DNAs for screening methods described herein. Such novelpluralities of polynucleotides are contemplated to be a part of thepresent invention and are described in detail below.

[0080] Suitable nucleic acid samples for screening on an array containtranscripts of interest or nucleic acids derived from the transcripts ofinterest (i.e., transcripts derived from the PR-regulated genes of thepresent invention). As used herein, a nucleic acid derived from atranscript refers to a nucleic acid for whose synthesis the mRNAtranscript or a subsequence thereof has ultimately served as a template.Thus, a cDNA reverse transcribed from a transcript, an RNA transcribedfrom that cDNA, a DNA amplified from the cDNA, an RNA transcribed fromthe amplified DNA, etc., are all derived from the transcript anddetection of such derived products is indicative of the presence and/orabundance of the original transcript in a sample. Thus, suitable samplesinclude, but are not limited to, transcripts of the gene or genes, cDNAreverse transcribed from the transcript, cRNA transcribed from the cDNA,DNA amplified from the genes, RNA transcribed from amplified DNA, andthe like.

[0081] Preferably, the nucleic acids for screening are obtained from ahomogenate of cells or tissues or other biological samples. Preferably,such sample is a total RNA preparation of a biological sample. Morepreferably in some embodiments, such a nucleic acid sample is the totalmRNA isolated from a biological sample. Biological samples may be of anybiological tissue or fluid or cells from any organism. Frequently thesample will be a “clinical sample” which is a sample derived from apatient, such as a breast tumor sample from a patient. Typical clinicalsamples include, but are not limited to, sputum, blood, blood cells(e.g., white cells), tissue or fine needle biopsy samples, urine,peritoneal fluid, and pleural fluid, or cells therefrom. Biologicalsamples may also include sections of tissues, such as frozen sections orformalin fixed sections taken for histological purposes.

[0082] In one embodiment, it is desirable to amplify the nucleic acidsample prior to hybridization. One of skill in the art will appreciatethat whatever amplification method is used, if a quantitative result isdesired, care must be taken to use a method that maintains or controlsfor the relative frequencies of the amplified nucleic acids to achievequantitative amplification. Methods of “quantitative” amplification arewell known to those of skill in the art. For example, quantitative PCRinvolves simultaneously co-amplifying a known quantity of a controlsequence using the same primers. This provides an internal standard thatmay be used to calibrate the PCR reaction. The high-density array maythen include probes specific to the internal standard for quantificationof the amplified nucleic acid. Other suitable amplification methodsinclude, but are not limited to polymerase chain reaction (PCR) Innis,et al., PCR Protocols. A guide to Methods and Application. AcademicPress, Inc. San Diego, (1990)), ligase chain reaction (LCR) (see Wu andWallace, Genomics, 4: 560 (1989), Landegren, et al., Science, 241: 1077(1988) and Barringer, et al., Gene, 89: 117 (1990), transcriptionamplification (Kwoh, et al., Proc. Natl. Acad. Sci. USA, 86: 1173(1989)), and self-sustained sequence replication (Guatelli, et al, Proc.Nat. Acad. Sci. USA, 87: 1874 (1990)).

[0083] Nucleic acid hybridization simply involves contacting a probe andtarget nucleic acid under conditions where the probe and itscomplementary target can form stable hybrid duplexes throughcomplementary base pairing. As used herein, hybridization conditionsrefer to standard hybridization conditions under which nucleic acidmolecules are used to identify similar nucleic acid molecules. Suchstandard conditions are disclosed, for example, in Sambrook et al.,Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Labs Press,1989. Sambrook et al., ibid., is incorporated by reference herein in itsentirety (see specifically, pages 9.31-9.62). In addition, formulae tocalculate the appropriate hybridization and wash conditions to achievehybridization permitting varying degrees of mismatch of nucleotides aredisclosed, for example, in Meinkoth et al., 1984, Anal. Biochem. 138,267-284; Meinkoth et al., ibid., is incorporated by reference herein inits entirety. Nucleic acids that do not form hybrid duplexes are washedaway from the hybridized nucleic acids and the hybridized nucleic acidscan then be detected, typically through detection of an attacheddetectable label. It is generally recognized that nucleic acids aredenatured by increasing the temperature or decreasing the saltconcentration of the buffer containing the nucleic acids. Under lowstringency conditions (e.g., low temperature and/or high salt) hybridduplexes (e.g., DNA:DNA, RNA:RNA, or RNA:DNA) will form even where theannealed sequences are not perfectly complementary. Thus specificity ofhybridization is reduced at lower stringency. Conversely, at higherstringency (e.g., higher temperature or lower salt) successfulhybridization requires fewer mismatches.

[0084] High stringency hybridization and washing conditions, as referredto herein, refer to conditions which permit isolation of nucleic acidmolecules having at least about 90% nucleic acid sequence identity withthe nucleic acid molecule being used to probe in the hybridizationreaction (i.e., conditions permitting about 10% or less mismatch ofnucleotides). One of skill in the art can use the formulae in Meinkothet al., 1984, Anal. Biochem. 138, 267-284 (incorporated herein byreference in its entirety) to calculate the appropriate hybridizationand wash conditions to achieve these particular levels of nucleotidemismatch. Such conditions will vary, depending on whether DNA:RNA orDNA:DNA hybrids are being formed. Calculated melting temperatures forDNA:DNA hybrids are 10° C. less than for DNA:RNA hybrids. In particularembodiments, stringent hybridization conditions for DNA:DNA hybridsinclude hybridization at an ionic strength of 6×SSC (0.9 M Na⁺) at atemperature of between about 20° C. and about 35° C., more preferably,between about 28° C. and about 40° C., and even more preferably, betweenabout 35° C. and about 45° C. In particular embodiments, stringenthybridization conditions for DNA:RNA hybrids include hybridization at anionic strength of 6×SSC (0.9 M Na⁺) at a temperature of between about30° C. and about 45° C., more preferably, between about 38° C. and about50° C., and even more preferably, between about 45° C. and about 55° C.These values are based on calculations of a melting temperature formolecules larger than about 100 nucleotides, 0% formamide and a G+Ccontent of about 40%. Alternatively, T_(m) can be calculated empiricallyas set forth in Sambrook et al., supra, pages 9.31 to 9.62.

[0085] The hybridized nucleic acids are detected by detecting one ormore labels attached to the sample nucleic acids. The labels may beincorporated by any of a number of means well known to those of skill inthe art. Detectable labels suitable for use in the present inventioninclude any composition detectable by spectroscopic, photochemical,biochemical, immunochemical, electrical, optical or chemical means.Useful labels in the present invention include biotin for staining withlabeled streptavidin conjugate, magnetic beads (e.g., Dynabeads.TM.),fluorescent dyes (e.g., fluorescein, texas red, rhodamine, greenfluorescent protein, and the like), radiolabels (e.g., ³H, ¹²⁵I, ³⁵S,¹⁴C, or ³²P), enzymes (e.g., horse radish peroxidase, alkalinephosphatase and others commonly used in an ELISA), and colorimetriclabels such as colloidal gold or colored glass or plastic (e.g.,polystyrene, polypropylene, latex, etc.) beads. Means of detecting suchlabels are well known to those of skill in the art. Thus, for example,radiolabels may be detected using photographic film or scintillationcounters, fluorescent markers may be detected using a photodetector todetect emitted light. Enzymatic labels are typically detected byproviding the enzyme with a substrate and detecting the reaction productproduced by the action of the enzyme on the substrate, and colorimetriclabels are detected by simply visualizing the colored label.

[0086] The term “quantifying” or “quantitating” when used in the contextof quantifying transcription levels of a gene can refer to absolute orto relative quantification. Absolute quantification may be accomplishedby inclusion of known concentration(s) of one or more target nucleicacids and referencing the hybridization intensity of unknowns with theknown target nucleic acids (e.g. through generation of a standardcurve). Alternatively, relative quantification can be accomplished bycomparison of hybridization signals between two or more genes, orbetween two or more treatments to quantify the changes in hybridizationintensity and, by implication, transcription level.

[0087] In one aspect of the present method, in vitro cell based assaysmay be designed to screen for compounds that affect the regulation ofgenes by a progesterone receptor at either the transcriptional ortranslational level. One, two or more promoters of the genes of thisinvention can be used to screen unknown ligands for their ability toselectively regulate transcription in vitro via PR-A or PR-B. Promotersof the selected genes can be linked to any of several reporters(including but not limited to chloramphenicol acetyl transferase, orluciferase) that measure transcriptional read-out. The promoters can betested as pure DNA, or as DNA bound to chromatin proteins. Ligands atdifferent concentrations and under different assay conditions can bescreened for their ability to either up- or down-regulate transcriptionof the selected genes, under the control of either PR-A, PR-B or both.In this embodiment, cells expressing progesterone receptors or celllysates comprising progesterone receptors are contacted with a putativeregulatory ligand for a time sufficient to act on the receptor. Thecells or cell lysates contain one, two or more promoters of the selectedgenes that are linked to any of several reporters, and the transcriptionor translation of the reporter genes is measured. Appropriate cells arepreferably prepared from any cell type that naturally expresses theprogesterone receptor or that recombinantly expresses the progesteronereceptor, thereby ensuring that the cells contain the transcriptionfactors required for transcription. The screen can be used to identifyligands that modulate the expression of the reporter construct. In suchscreens, the level of reporter gene expression is determined in thepresence of the test ligand and compared to the level of expression inthe absence of the test ligand, or the test ligand is compared to aknown ligand, such as progesterone.

[0088] In one aspect of the present method, the step of detecting caninclude detecting the expression of one or more downstream genes of theinvention in intact animals or tissues obtained from such animals.Mammalian (i.e. mouse, rat, monkey) or non-mammalian (ie. chicken)species that express PRs in their tissues and elaborate progesterone,can be the test animals. The unknown test ligand is introduced intointact or castrated animals by any of a variety of oral, intravenous,intramuscular, subdermal or other routes, for a variety of treatmenttimes or concentrations. The tissues to be surveyed can be either normalor malignant progesterone targets (including but not limited to themammary glands, mammary cancers, uterus, or endometrial cancers). Thepresence and quantity of endogenous mRNA or protein expression of one,two or more of the genes of this invention can be measured in thoseprogesterone target tissues. The gene markers can be measured in tissuesthat are fresh, frozen, fixed or otherwise preserved. They can bemeasured in cytoplasmic or nuclear organ-, tissue- or cell-extracts; orin cell membranes including but not limited to plasma, cytoplasmic,mitochondrial, golgi or nuclear membranes; in the nuclear matrix; or incellular organelles and their extracts including but not limited toribosomes, nuclei, nucleoli, mitochondria, or golgi. Assays forendogenous expression of mRNAs or proteins encoded by the genes of thisinvention can be performed as described above. Alternatively, intacttransgenic animals can be generated for ligand screening. Animals can begenetically manipulated to express the promoters of one, two or more ofthe genes of this invention linked to one or more reporters such asX-gal. After treatment of the animals with the test unknown ligands,expression of galactosidase can be measured calorimetrically in normalor malignant progesterone target organs, or tissues containing PRs, orin organs or tissues during development. Ligands that activate througheither PR-A or PR-B can be identified by their ability to regulate theappropriate selective gene promoter.

[0089] The method of the present invention includes a step of comparingthe results of detecting the expression of the one or more downstreamgenes in the presence and in the absence of the putative regulatoryligand, in order to determine whether any observed change in expressionis due to the presence of the putative regulatory compound. The step ofcomparing further includes comparing the expression of the one or moredownstream genes detected in the presence of the ligand to the manner ofexpression of the genes that is associated with the activation of theprogesterone receptor when the receptor is activated (described indetail below). As discussed above, the present inventors have identifiedthe expression profile of multiple genes that are regulated by PR,including the manner in which the genes are regulated (i.e., by which PRisoform, and in which direction by such isoform). Therefore, one candetermine whether the contact of the receptor with the putative ligandresults in a profile of gene expression that is substantially similar tothe profile of gene expression of an activated PR (i.e., agonistaction), or whether contact of the receptor with the putative ligandresults in a profile of gene expression that is an inhibition, orreversal, of the profile of gene expression of an activated PR (i.e.,antagonist action). According to the present invention, a putative testligand is determined to be a regulator of PR if the expression of thegene or genes detected after contact of the PR with the ligand isstatistically significantly altered (i.e., up or down) from theexpression detected in the profile of a PR that has been activated byprogesterone, or an equivalent agonist. The expression profiles for thegenes in Tables 1-19 were determined by evaluating PR that had beenactivated by progesterone after 6 hours.

[0090] A PR agonist is identified by detecting an expression profile inthe presence of the agonist that, at a minimum, regulates the expressionof the gene in the same direction (i.e, upregulation or downregulation)as it is regulated by an activated progesterone receptor (e.g., themanner of expression of the gene as indicated in Tables 1-7, 9-15 or 18and 19). More specifically, and by way of example, detection of theregulation of the expression of the gene in the “manner” associated withthe activation of the PR (i.e., the natural activation of the PR), at aminimum, refers to the detection of the upregulation of a gene that hasnow been shown by the present inventors to be selectively upregula3tedby PR-A (genes in Tables 1 and 9) when the receptor is in the presenceof the putative agonist, as compared to in the absence of the putativeagonist. Similarly, an agonist is identified when the expression of agene from Tables 2 or 10 is detected to be downregulated in the presenceof the putative agonist as compared to in the absence of the agonist.Such downregulation also indicates that, at a minimum, the agonistregulated the PR-A isoform. In a preferred embodiment, the agonistregulates the expression of the gene in the same direction and to atleast about 10%, and more preferably at least 20%, and more preferablyat least 25%, and more preferably at least 30%, and more preferably atleast 35%, and more preferably at least 40%, and more preferably atleast 45%, and more preferably at least 50%, and preferably at least55%, and more preferably at least 60%, and more preferably at least 65%,and more preferably at least 70%, and more preferably at least 75%, andmore preferably at least 80%, and more preferably at least 85%, and morepreferably at least 90%, and more preferably at least 95%, of the levelof expression that is induced by a progesterone receptor that has beenactivated by progesterone. In a particularly preferred embodiment, anagonist regulates the expression of the gene in the same direction andto a level of expression that is substantially equal to or greater thanthe level of expression that is induced by a progesterone receptor thathas been activated by progesterone. The level of expression isdetermined with reference to the expression of the gene in the absenceof the putative regulatory compound, or in the absence of progesterone,in the case of the control. The level of expression is then compared tothe level of expression of the control, or the level of expression thatis expected from the control.

[0091] A PR antagonist is identified by detecting an expression profilein the presence of the antagonist that, at a minimum, regulates theexpression of the gene in the opposite direction (i.e, upregulationinstead of downregulation) than the gene is regulated by an activatedprogesterone receptor (e.g., the manner of expression of the gene asindicated in Tables 1-7, 9-15 or 18 and 19), or causes a statisticallysignificant reduction in the expression level of the gene as compared tothe expression level of the gene when it is activated by progesterone,or prevents the regulation of the gene as compared to the regulation ofthe gene when the receptor is activated by progesterone. In theantagonist screening embodiments, the putative antagonists are screenedagainst a PR that is activated, and so in the absence of the putativeantagonist, the expression profile of the genes should be substantiallythe same as the expression profile set forth in Tables 1-7,9-15 and18-19). Therefore, any statistically significant decrease (inhibition)in the expression level of the gene or a reversal of the direction ofexpression of the gene in the presence of the putative antagonist ascompared to in the absence of the antagonist, indicates that theputative ligand is an antagonist. In a preferred embodiment, theantagonist inhibits the expression of the detected gene by at least 5%,and more preferably at least 10%, and more preferably at least 15%, andmore preferably at least 20%, and more preferably at least 25%, and morepreferably at least 30%, and more preferably at least 40%, and morepreferably at least 50%, and more preferably at least 60%, and morepreferably at least 70%, and more preferably at least 80%, and morepreferably at least 90%, as compared to the level of expression that isinduced by the activated progesterone receptor in the absence of theputative antagonist. In one embodiment, an antagonist regulates theexpression of the gene in the opposite direction (i.e., reverses theexpression) as compared to the expression of the gene induced by theactivated progesterone receptor in the absence of the putativeantagonist.

[0092] It will be appreciated by those of skill in the art thatdifferences between the expression of genes regulated by the putativeligand (via the PR) and the expression of genes regulated by the naturalligand (via the PR) may be small or large. Some small differences may bevery reproducible and therefore the ligand identified by the method canbe useful. For other purposes, large differences may be desirable forease of detection of the regulatory activity. It will be thereforeappreciated that the exact boundary between what is called an agonistand what is called an antagonist can shift, depending on the goal of thescreening assay. For some assays it may be useful to set thresholdlevels of change. For other purposes the putative antagonist ligand maysimply have a lower level of activity than an agonist ligand (e.g. atest ligand having 10% of the activity of an agonist can be anantagonist of that agonist). This may depend on the technique being usedfor detection as well as on the number of genes which are being tested.One of skill in the art can readily determine the criteria for selectionof suitable antagonists.

[0093] Given the knowledge of the gene expression profiles of thepresent invention as set forth in Tables 1-7, 9-15 and 18-19, one ofskill in the art can, for the first time, identify isoform-specificregulators of progesterone receptors. Therefore, one embodiment of thepresent invention relates to a method to identify isoform-specificagonists of progesterone receptors. This method includes the steps of:(a) contacting a progesterone receptor with a putative agonist ligand,wherein the progesterone receptor is selected from the group consistingof progesterone receptor A (PR-A) and progesterone receptor B (PR-B),under conditions wherein in the absence of the putative agonist ligand,the progesterone receptor is not activated; (b) detecting expression ofat least one gene that is selectively regulated by the progesteronereceptor when the progesterone receptor is activated, and (c) comparingthe expression of the at least one gene in the presence and in theabsence of the putative agonist ligand. In this embodiment, the at leastone gene is selected from the group consisting of: (i) at least one genethat is exclusively upregulated or downregulated by PR-A, chosen from aTable selected from the group consisting of Table 1 and Table 2; and,(ii) at least one gene that is exclusively upregulated or downregulatedby PR-B chosen from a Table selected from the group consisting of Table3 and Table 4. Detection of regulation of the expression of the at leastone gene in the manner associated with activation of the progesteronereceptor as set forth in (i) but not (ii), indicates that the putativeagonist ligand is a PR-A-specific agonist, and wherein detection ofregulation of the expression of the at least one gene in the mannerassociated with activation of the progesterone receptor as set forth in(ii) but not (i), indicates that the putative agonist ligand is aPR-B-specific agonist.

[0094] Another embodiment of the present invention relates to a methodto identify isoform-specific antagonists of progesterone receptors,comprising: (a) contacting a progesterone receptor with a putativeantagonist ligand, wherein the progesterone receptor is selected fromthe group consisting of progesterone receptor A (PR-A) and progesteronereceptor B (PR-B), under conditions wherein, in the absence of theputative antagonist ligand, the progesterone receptor is activated; (b)detecting expression of at least one gene that is regulated by theprogesterone receptor when the progesterone receptor is activated; and(c) comparing the expression of the at least one gene in the presenceand in the absence of the putative antagonist ligand. In thisembodiment, the at least one gene is selected from the group consistingof: (i) at least one gene that is exclusively upregulated ordownregulated by PR-A, chosen from a Table selected from the groupconsisting of Table 1 and Table 2; and, (ii) at least one gene that isexclusively upregulated or downregulated by PR-B chosen from a Tableselected from the group consisting of Table 3 and Table 4. In thepresence of the putative antagonist ligand, detection of inhibition orreversal of the regulation of expression of the at least one gene ascompared to the regulation of expression of the at least one gene in themanner associated with activation of the progesterone receptor as setforth in (i) but not (ii), indicates that the putative antagonist ligandis a PR-A-specific antagonist, and wherein, in the presence of theputative antagonist ligand, detection of inhibition or reversal of theregulation of expression of the at least one gene as compared to theregulation of the expression of the at least one gene in the mannerassociated with activation of the progesterone receptor as set forth in(ii) but not (i), indicates that the putative antagonist ligand is aPR-B-specific antagonist.

[0095] Given the knowledge of the genes regulated exclusively byprogesterone receptor isoforms according to the present invention, oneof skill in the art will be able to select one or more genes to detectin a method of the present invention, and the selection of the one ormore genes can be determined by different factors. For example, one ofskill in the art may wish to further select genes to be detected on thebasis of the function of the gene or gene product, on the basis oftissue-type in which a PR is expressed, on the basis of association witha particular condition or disease, or on the basis of the change in thelevel of expression of the gene when in the presence of progesterone.Such embodiments have generally been described above.

[0096] Antiprogestins that selectively inhibit progestin effects on onlyone of the two PRs, would be highly desirable, but do not exist atpresent. Such antagonist ligands would be useful not only for breastcancer treatment, but to treat a variety of reproductive disorders, andfor contraception. Antagonists that can inhibit only PR-A withoutaffecting PR-B (and vice-versa) would be highly desirable. The currentinvention allows for rapid and direct screening for such ligands. Forexample, the invention identifies clusters of genes that are upregulatedonly by PR-A or PR-B in the presence of the agonist, progesterone. Thesegene clusters are perfect targets for antiprogestin (antagonist) andprogestin (agonist) screening by the cell-free in vitro, intact cell invitro, or whole animal endogenous or transgenic methods described above.For the embodiment related to antagonists, a selected cluster of one,two or more of the genes of this invention that are exclusivelyregulated by PR-A or PR-B would first be activated by progesterone oranother progestin. Putative antiprogestins would be screened andselected on the basis of their ability to reverse or inhibit the effectsof the agonist, progesterone, by comparing the expression profiles ofthe genes in the presence of the putative antiprogestin to theexpression profile of the genes as a result of activation of thereceptor with a progestin. Isoform-specific agonists of PRs can besimilarly selected by choosing ligands on the basis of their ability tomimic the effects of the agonist, progesterone, on the PR isoforms.

[0097] These two embodiments of the present invention take advantage ofthe knowledge provided herein of the isoform-specific regulation ofgenes by progesterone receptors. Prior to the present invention, suchassays were impossible, because the specific regulation of a gene by onePR isoform, and not the other, was not known. By way of example, if agene in Table 1 is detected (i.e., a gene that is known to beupregulated selectively (i.e., exclusively, uniquely) by PR-A) when thePR to be tested (at least PR-A or a combination of PR-A and PR-B) is inthe presence of a putative regulatory ligand, and the expression of thatgene is determined to be in the manner associated with activation of theprogesterone receptor (i.e., the gene is upregulated), then it can beconcluded that the putative regulatory compound is a PR-A-specificagonist, because the present inventors have shown that the gene isexclusively upregulated by PR-A. Similarly, if a gene in Table 4 isdetected (i.e., a gene that is known to be downregulated selectively(i.e., exclusively, uniquely) by PR-B) when the PR to be tested (atleast PR-B or a combination of PR-A and PR-B) is in the presence of aputative regulatory ligand, and the expression of that gene isdetermined to be is in the manner associated with activation of theprogesterone receptor (i.e., the gene is downregulated), then it can beconcluded that the putative regulatory compound is a PR-B-specificagonist, because the present inventors have shown that this particulargene is exclusively downregulated by PR-B. For a putative antagonist, ifthe same gene in Table 4 is detected when the PR to be tested is or willbe activated and is in the presence of the putative antagonist, and theexpression of that gene is determined to be inhibited or reversed (i.e.,the gene is upregulated or is statistically significantly lessdownregulated) as compared to the expression of the gene in the mannerassociated with activation of the progesterone receptor, then it can beconcluded that the putative regulatory compound is a PR-B-specificantagonist, because the present inventors have shown that thisparticular gene is exclusively downregulated by PR-B.

[0098] The particular details relating to the contacting, detecting andcomparing steps of the above-described methods for identification of PRisoform-specific ligands are substantially the same as those describedabove for the broader methods of identifying PR regulatory ligands andwill not be repeated here.

[0099] Agonists and antagonists of progesterone receptors identified bythe above methods or any other suitable method are useful in a varietyof therapeutic methods as described herein.

[0100] Yet another embodiment of the present invention relates to amethod to identify a tissue-specific agonist of a progesterone receptor.This method includes the steps of: (a) providing an expression profilefor at least one gene that is known to be regulated by a progesteronereceptor in both a first and second tissue type when the progesteronereceptor is activated, wherein the at least one gene is chosen from thegenes in any one or more of the genes in Tables 1-7; (b) contacting aprogesterone receptor expressed by a first tissue type with a putativeagonist ligand, wherein the progesterone receptor is selected from thegroup consisting of progesterone receptor A (PR-A) and progesteronereceptor B (PR-B), under conditions wherein, in the absence of theputative agonist ligand, the progesterone receptor is not activated; (c)contacting a progesterone receptor expressed by a second tissue typewith the putative agonist ligand under conditions wherein, in theabsence of the putative agonist ligand, the progesterone receptor is notactivated, wherein the progesterone receptor is the same isoform as theprogesterone receptor contacted in (b); (d) detecting expression of theat least one gene from (a); and, (e) comparing the expression of the atleast one gene in the presence and in the absence of the putativeagonist ligand in each of the first and second tissue types. Detectionof regulation of the expression of the at least one gene in one of thefirst or second tissue types in the manner associated with activation ofthe progesterone receptor as set forth in the expression profile of (a),and detection of inhibition of regulation or no regulation of the atleast one gene in the other of the first or second tissue types, ascompared to the expression of the at least one gene associated withactivation of the progesterone receptor as set forth in the expressionprofile of (a), indicates that the putative agonist ligand is atissue-specific progesterone receptor agonist.

[0101] Similarly, another embodiment of the present invention relates toa method to identify a tissue-specific agonist of a progesteronereceptor, such method comprising: (a) providing an expression profilefor at least one gene that is known to be regulated by a progesteronereceptor in a first tissue type but not a second tissue type when theprogesterone receptor is activated, wherein the at least one gene ischosen from the genes in any one or more of the genes in Tables 1-7; (b)contacting a progesterone receptor expressed by the first tissue typewith a putative agonist ligand, wherein the progesterone receptor isselected from the group consisting of progesterone receptor A (PR-A) andprogesterone receptor B (PR-B), under conditions wherein, in the absenceof the putative agonist ligand, the progesterone receptor is notactivated; (c) detecting expression of the at least one gene from (a);(d) comparing the expression of the at least one gene in the presenceand in the absence of the putative agonist ligand in the first tissuetype, wherein detection of regulation of the expression of the at leastone gene in the first tissue type in the manner associated withactivation of the progesterone receptor as set forth in the expressionprofile of (a) indicates that the putative agonist ligand is atissue-specific progesterone receptor agonist for the first tissue type.In this embodiment, it is desirable to include additional controls orthe detection of multiple genes that confirm that the regulation of thePR by the putative regulatory ligand is tissue-specific.

[0102] Another embodiment of the present invention relates to a methodto identify a tissue-specific antagonist of a progesterone receptor.This method includes the steps of: (a) providing an expression profilefor at least one gene that is known to be regulated by a progesteronereceptor in both a first and second tissue type when the progesteronereceptor is activated, wherein the at least one gene is chosen from thegenes in any one or more of the genes in Tables 1-7; (b) contacting aprogesterone receptor expressed by a first tissue type with a putativeantagonist ligand, wherein the progesterone receptor is selected fromthe group consisting of progesterone receptor A (PR-A) and progesteronereceptor B (PR-B), under conditions wherein, in the absence of theputative antagonist ligand, the progesterone receptor is activated; (c)contacting a progesterone receptor expressed by a second tissue typewith the putative antagonist ligand, wherein the progesterone receptoris selected from the group consisting of progesterone receptor A (PR-A)and progesterone receptor B (PR-B), under conditions wherein, in theabsence of the putative antagonist ligand, the progesterone receptor isactivated; (d) detecting expression of the at least one gene from (a);and, (e) comparing the expression of the at least one gene in thepresence and in the absence of the putative antagonist ligand in each ofthe first and second tissue types, wherein detection of regulation ofthe expression of the at least one gene in one of the first or secondtissue types in the manner associated with activation of theprogesterone receptor as set forth in the expression profile of (a) inthe presence of the putative antagonist ligand, and detection ofinhibition or reversal of regulation of expression of the at least onegene in the other of the first or second tissue types in the presence ofthe putative antagonist ligand, as compared to the expression of the atleast one gene associated with activation of the progesterone receptoras set forth in the expression profile of (a), indicates that theputative antagonist ligand is a tissue-specific progesterone receptorantagonist.

[0103] Similarly, another embodiment of the present invention relates toa method to identify a tissue-specific antagonist of a progesteronereceptor, such method including the steps of: (a) providing anexpression profile for at least one gene that is known to be regulatedby a progesterone receptor in a first but not in a second tissue typewhen the progesterone receptor is activated, wherein the at least onegene is chosen from the genes in any one or more of the genes in Tables1-7; (b) contacting a progesterone receptor expressed by a first tissuetype with a putative antagonist ligand, wherein the progesteronereceptor is selected from the group consisting of progesterone receptorA (PR-A) and progesterone receptor B (PR-B), under conditions wherein,in the absence of the putative antagonist ligand, the progesteronereceptor is activated; (c) detecting expression of the at least one genefrom (a); and, (d) comparing the expression of the at least one gene inthe presence and in the absence of the putative antagonist ligand in thefirst tissue type, wherein detection of inhibition or reversal ofregulation of expression of the at least one gene in the first tissuetype in the presence of the putative antagonist ligand, as compared tothe expression of the at least one gene associated with activation ofthe progesterone receptor as set forth in the expression profile of (a),indicates that the putative antagonist ligand is a tissue-specificprogesterone receptor antagonist of the first tissue type. In thisembodiment, it is desirable to include additional controls or thedetection of multiple genes that confirm that the regulation of the PRby the putative regulatory ligand is tissue-specific.

[0104] In one aspect of any of the above-described embodiments foridentifying a tissue-specific regulator of PR activity, the first tissuetype is breast, and at least one gene is selected from the groupconsisting of any one or more of the genes in Tables 1-7. In general,the first or second tissue type can be any tissue type, including anycell type, that expresses a progesterone receptor. For example, tissuesthat are known to express progesterone receptors include, but are notlimited to, breast, uterus, bone, cartilage, cardiovascular tissues,heart, lung, brain, meninges, pituitary, ovary, oocyte, corpus luteum,oviduct, fallopian tubes, T lymphocytes, B lymphocytes, thymocytes,salivary gland, placenta, skin, gut, pancreas, liver, testis,epididymis, bladder, urinary tract, eye, and teeth.

[0105] In another aspect, the first tissue type is a non-malignanttissue and wherein the second tissue type is a malignant tissue from thesame tissue source as the first tissue type. A preferred tissue sourcefor screening for regulators of malignant tissue but not non-malignanttissue is breast tissue.

[0106] In another aspect, the first tissue type is a normal tissue andwherein the second tissue type is a non-malignant, abnormal tissue. Suchtissues include, but are not limited to, tissues from endometriosis andleiomyoma of the uterus, fibrocystic disease of the breast, orpolycystic ovary.

[0107] In one aspect of the tissue-specific methods of the presentinvention, the method includes the detection of the any one or more ofthe following genes: 11-beta-hydroxysteroid dehydrogenase type 2, tissuefactor gene, PCI gene (plasminogen activator inhibitor 3), MAD-3Ikβ-alpha, Niemann-Pick C disease (NPC1), platelet-typephosphofructokinase, phenylethanolamine n-methyltransferase (PNMT),transforming growth factor-beta 3 (TGF-beta3), Monocyte ChemotacticProtein 1, delta sleep inducing peptide (related to TSC-22), estrogenreceptor-related protein (hERRa1). These genes are of particularinterest when one of the tissue types is the endometrium.

[0108] In another aspect of the tissue-specific methods of the presentinvention, the method includes the detection of the any one or more ofthe following genes: growth arrest-specific protein (gas6), tissuefactor gene, NF-IL6-beta (C/EBPbeta), PCI gene (plasminogen activatorinhibitor), Stat5A, calcium-binding protein S100P, MSX-2, lipocortin II(calpactin I), selenium-binding protein (hSBP), and bullous pemphigoidantigen (plakin family). These genes are of particular interest when oneof the tissue types is the breast.

[0109] In another aspect of the tissue-specific methods of the presentinvention, the method includes the detection of phenylethanolaminen-methyltransferase (PNMT) adrenal medulla. This gene is of particularinterest when one of the tissue types is brain tissue.

[0110] In another aspect of the tissue-specific methods of the presentinvention, the method includes the detection of proteasome-like subunitMECL-1. This gene is of particular interest when one of the tissue typesis thymus tissue.

[0111] In yet another aspect of these methods, the expression profile ofgenes regulated by a progesterone receptor in the first or second tissuetype is provided by a method comprising:

[0112] (a) providing a first cell of a selected tissue type thatexpresses a progesterone receptor A (PR-A) and not a progesteronereceptor B (PR-B) and a second cell of the same tissue type thatexpresses PR-B and not PR-A; (b)stimulating the progesterone receptorsin (a) by contacting the first and second cells with a progesteronereceptor stimulatory ligand; (c) detecting expression of genes by thefirst and second cells in the presence of the stimulatory ligand and inthe absence of the stimulatory ligand, wherein a difference in theexpression of a gene in the presence of the stimulatory ligand ascompared to in the absence of the stimulatory ligand, indicates that thegene is regulated by the progesterone receptor in the selected tissuetype.

[0113] The present invention defines genes that are regulated by PR-Avs. PR-B in breast cancer cells. It is believed that many, if not mostof these genes, will also be regulated by progesterone receptors inother tissues. Similar data can be generated for other tissues,including the uterus, bone, cardiovascular tissues, etc., or malignantvs. normal tissues. Progestin regulated genes in other tissues, whichdiffer from the genes in breast cancer cells of this invention, can beidentified, and be used to screen for ligands that regulate candidategenes only in the desired tissue. For example, using the appropriategene clusters, one could identify a ligand that activates PR-A in theuterus but not the breast. Similarly one could screen out ligands thathave undesirable organ or tissue effects. For example, ligands that areinadvertently bioactive in the liver, where they might induce livertoxicity, could be discarded. Alternatively, when a gene is regulated inboth tissue types, one can screen for ligands that regulate theexpression of the gene in one tissue type, but not the other tissuetype. For example, by using tissue specific methods described above, itis also possible to screen for antagonists that block the actions ofprogestins in one organ or tissue and through one PR isoform, but notanother organ or tissue and the other PR isoform. For example, if PR-Aare “good” receptors in the uterus but not the breast, a selective“antiprogestin-A” might be found that is only inhibitory in the breast.

[0114] Given the guidance provided herein, it is within the ability ofthose of skill in the art to screen other tissue types for the presenceor absence of the genes regulated by PR in breast tissue, and/or toperform a de novo screening assay for the identification of genesregulated by PR in another tissue, to develop gene expression profilesfor use in screening for tissue specific ligands. One of skill in theart can now look to see if a given gene that is regulated by PR inbreast is also regulated by PR in another tissue type, thereby providinga gene profile for use in the tissue-specific ligand identificationmethods of the present invention.

[0115] The particular details relating to the contacting, detecting andcomparing steps of the above-described methods for identification of PRisoform-specific ligands are substantially the same as those describedabove for the broader methods of identifying PR regulatory ligands andwill not be repeated here.

[0116] Another method of the present invention relates to a method toidentify genes that are regulated by a progesterone receptor in two ormore tissue types. The method includes the steps of: (a) activating aprogesterone receptor in two or more tissue types that express theprogesterone receptor; (b) detecting expression of at least one gene inthe two or more tissue types, the at least one gene being chosen from agene in any one or more of Tables 1-7, and, (c) identifying genes thatare regulated by the progesterone receptor in each of the two or moretissue types. In one embodiment, the method further includes detectingwhether the genes are regulated selectively by PR-A, selectively byPR-B, or by both PR-A and PR-B. This method can generally be used toprovide a profile of genes in a tissue type other than breast. Such aprofile can then be used in a method for the identification oftissue-specific progesterone receptor ligands as described above, or ina method of determining a profile of genes for a given tissue sample asdescribed below.

[0117] Yet another embodiment of the present invention relates to amethod to determine the profile of genes regulated by progesteronereceptors in a tissue sample. In a preferred embodiment, the sample is abreast tumor sample. This method includes the steps of: (a) obtainingfrom a patient a breast tumor sample; (b) detecting expression of atleast one gene in the breast tumor sample that is regulated by aprogesterone receptor when the progesterone receptor is activated; and,(c) producing a profile of genes for the tumor sample that are regulatedselectively by PR-A, selectively by PR-B, or by both PR-A and PR-B. Inthis embodiment, the gene(s) to be profiled are being selected from thegroup consisting of: (i) at least one gene that is selectivelyupregulated by PR-A chosen from a gene in Table 9; (ii) at least onegene that is selectively downregulated by PR-A chosen from a gene inTable 10; (iii) at least one gene that is selectively upregulated byPR-B chosen from a gene in Table 11; (iv) at least one gene that isselectively downregulated by PR-B chosen from a gene in Table 12; (v) atleast one gene that is upregulated or downregulated by both PR-A andPR-B chosen from a gene in Table 13; (vi) at least one gene that isreciprocally regulated by PR-A and PR-B chosen from a gene in Table 14;and, (vii) at least one gene that is regulated by one of the PR-A or thePR-B, wherein regulation of the gene is altered when the other of thePR-A or PR-B is expressed by the same cell, chosen from a gene in Table15.

[0118] Because of their physiological importance in the breast, PRs areroutinely measured in all breast cancers when the disease is firstdiagnosed. Presence of PRs, especially if the levels are high, informsthe oncologist that the tumor is likely to be “hormone-dependent” andwill respond to endocrine treatments. This spares the woman from muchharsher treatments involving chemotherapies. Additionally, the number ofPRs allows the oncologist to predict how aggressive the tumor is likelyto be. High PR levels in her tumor indicates that a woman's prognosis isgood. Thus measurement of total PRs levels plays a key role in themanagement of breast cancers.

[0119] Both PR-A and PR-B are present in PR-positive breast cancers. ThePR-A:PR-B ratio varies widely from tumor to tumor, and some tumorsexpress only one or the other isoform. However, the clinicalconsequences of this heterogeneity are unknown. Because thetranscriptional effects of the two PRs are believed to be so different,fluctuations in their ratio are expected to critically influence thebiology of the tumors. However, at present, how that biology is affectedis unknown. Whether in fact, PR-A are “bad” and PR-B are “good” inbreast cancers, is also unknown. Since most breast cancer cell lineslose their PRs, and both isoforms are co-expressed in cell lines thatretain their PRs, one way to determine the biological consequences ofvarying A:B ratios is to define the endogenous genes that each of thetwo PRs regulates independently. Knowledge of the unique sets of genesthat are selectively regulated by each PR isoform as disclosed hereinallows the genes to serve as surrogate markers for the presence andfunction of PR-A vs. PR-B. Furthermore, knowledge of such genes andtheir promoters, allows the genes to serve as a tool for screening PR-Avs. PR-B selective ligands. However, prior to the present invention,defining which sets of genes were uniquely regulated by one or the otherPR in breast cancers was impossible because both receptors aresimultaneously activated by progesterone treatment. The presentinvention has provided a solution to this problem.

[0120] As discussed above, total PRs are routinely measured in allprimary breast cancers as a guide to therapy. Their presence and levelsare used to predict whether the tumor is likely to respond to hormonetreatments, and to estimate disease prognosis. Tumors that lack PRs haveless than 10% chance of responding to hormone treatments; tumors thatcontain PRs have on average a 70% chance of responding to hormonetreatments depending on the receptor levels. These numbers arestatistical only, and therefore are not specifically informative for anyindividual patient. The present invention has led to the development ofassays that profile the tumor of an individual patient for “good” and“bad” surrogate markers of PR-A and PR-B. Thus it is now possible tomeasure not only the presence of PRs in a tumor, but the function of thePRs in that tumor.

[0121] In this embodiment, one or more of the genes set forth in Tables9-15 are selected to be screened in a tissue sample from a patient.Preferably, the tissue sample is a breast tumor sample. The expressionof the genes in the tissue sample can be detected using techniquesdescribed above for the various other methods of the present invention.For example, transcript expression levels of the selected genes can bemeasured in the tumor of a patient, by any of a number of known methods.For mRNA expression, methods include but are not limited to: northernblotting; reverse transcriptase-polymerase chain reaction and detectionof the product; use of labeled mRNA from the tumor to probe cDNAs oroligonucleotides encoding all or part of the PR-responsive genes ofinterest, arrayed on any of a variety of surfaces, as described above.For detection of protein expression levels of the selected genes,methods include but are not limited to: western blotting,immunocytochemistry, flow cytometry or other immunologic-based assays;assays based on a property of the protein including but not limited toDNA binding, ligand binding, or interaction with other protein partners,as described above. The presence and quantity of each gene marker can bemeasured in primary tumors, metastatic tumors, locally recurring tumors,ductal carcinomas in situ, or other tumors of breast cell origin. Themarkers can be measured in solid tumors that are fresh, frozen, fixed orotherwise preserved. They can be measured in cytoplasmic or nucleartumor extracts; or in tumor membranes including but not limited toplasma, mitochondrial, golgi or nuclear membranes; in the nuclearmatrix; or in tumor cell organelles and their extracts including but notlimited to ribosomes, nuclei, mitochondria, golgi.

[0122] A profile of individual gene markers, including a matrix of twoor more markers, can be generated by one or more of the methodsdescribed above. According to the present invention, a profile of thegenes regulated by progesterone receptors in a tissue sample refers to areporting of the expression level of a given gene from Tables 9-15,wherein, based on the knowledge of the regulation of the genes providedby Tables 9-15, includes a classification of the gene with regard to howthe gene is regulated by the PR isoforms. For example, if the gene,estrogen receptor-related protein, is identified as being expressed by atumor sample, the profile for the tumor will include the reporting ofthe expression of at least one gene that is exclusively regulated byPR-A. The data can be reported as raw data, and/or statisticallyanalyzed by any of a variety of methods, and/or combined with any otherprognostic marker(s) including but not limited to ER, % S-phase, otherproliferation markers, markers of ER expression, tumor suppressor genes,etc. Prior to the present invention, one of skill in the art would nothave known to screen breast tumors for the genes in Tables 1-7, 9-10 or18-19, (excepting genes in Table 16), and one of skill in the art wouldnot have been able to classify any of these genes on the basis of the PRisoform regulation.

[0123] Given the knowledge of the genes regulated by progesteronereceptor isoforms according to the present invention, one of skill inthe art will be able to select one or more genes to detect in thismethod of the present invention, and the selection of the one or moregenes can be determined by different factors. For example, one of skillin the art may wish to further select genes to be detected on the basisof the function of the gene or gene product, on the basis of PRisoform-specificity, on the basis of association with a particularcondition or disease, or on the basis of the change in the level ofexpression of the gene when in the presence of progesterone. Suchembodiments have generally been described above.

[0124] In one aspect of this method of the present invention, the methodpreferably includes the detection of the any one or more of thefollowing genes: growth arrest-specific protein (gas6), tissue factorgene, NF-IL6-beta (C/EBPbeta), PCI gene (plasminogen activatorinhibitor), Stat5A, calcium-binding protein S100P, MSX-2, lipocortin 11(calpactin I), selenium-binding protein (hSBP), and bullous pemphigoidantigen (plakin family). These genes are of particular interest when oneof the tissue types is the breast.

[0125] In another aspect of this method of the present invention, themethod preferably includes the detection of the any one of more of thefollowing genes: growth arrest-specific protein (gas6), NF-IL6-beta(C/EBPbeta), calcium-binding protein S100P, MSX-2, selenium-bindingprotein (hSBP), and bullous pemphigoid antigen (plakin family).

[0126] The profile of genes provided as a result of the screening of thetissue can be used by the patient or physician for decision-makingregarding the usefulness of endocrine therapies in general (i.e.oophorectomy, antiestrogens or other SERMs, aromatase inhibitors, orothers), or progestational therapy in particular (high dose progestins,antiprogestins or others). The profile can be used to estimate how thedisease is likely to respond and progress in any individual patient.Clinical trials can be developed to correlate the relationship betweenPR-A vs. PR-B regulated genes, and the biological behavior of the tumor.

[0127] In addition, if it is determined that one PR isoform is harmful,and the other beneficial, the gene clusters of this invention can bemeasured or quantified in normal breast or other normal tissues, eitherfrozen or preserved, or in tissue or organelle extracts as describedabove, either alone or together with other markers (for example BRCA1),and used for genetic counseling.

[0128] In addition, one of the key questions that the present inventioncan address, is whether breast tumors that overexpress PR-B or PR-Arepresent phenotypically different tumor subsets. For example, breasttumors that are identified as “PR-B rich” based on their expression ofPR-B specific genes, can be further assessed in terms of usual clinicalparameters—tumor staging, pathological staging, size, nodal status,metastasis, responsiveness to hormonal and chemotherapies—and comparedto parallel tumors that are “PR-A rich”. Without being bound by theory,the present inventors predict that PR-B rich tumors may be larger andmore aggressive than PR-A rich tumors. One reason for this is that thisinvention demonstrates that PR-B strongly and uniquely upregulate twoimportant genes that support angiogenesis: L13720, growtharrest-specific protein (gas 6) is increased 23.1 fold; M27436, tissuefactor gene is increased 18.1 fold. Increased angiogenesis, byincreasing their blood (and nutrient) supply, promotes tumor growth.This is one example of the hypotheses that can be raised and tested,based on the new information revealed by this invention.

[0129] In one aspect of this embodiment of the invention, the profilingof genes can be extended to other tissue types and/or other genes. Forexample, as discussed above, using the guidance provided herein, it iswithin the ability of those of skill in the art to screen other tissuetypes for the presence or absence of the genes regulated by PR in breasttissue, and/or to perform a de novo screening assay for theidentification of genes regulated by PR in another tissue, to developgene expression profiles for use in screening for tissue specificligands. One of skill in the art can now look to see if a given genethat is regulated by PR in breast is also regulated by PR in anothertissue type. Moreover, the 4 breast cancer cell lines described inExample 1, can be used to screen other gene arrays, including arrays ofexpressed tag sequences, to discover additional novel, PR-A vs. PR-Bregulated genes. The procedure used to produce these cells can beextended to cells from other tissue sources (e.g., the uterus), and newPR-A and PR-B regulated genes can be identified for these tissuesources. Additional applications of the present invention includescreening for genes that are regulated by PRs in a ligand-independentmanner. The extension of the gene profiles to other tissue types willallow for the development of a variety of diagnostic assays in othertissues and for diseases related to such other tissues, as well as theidentification of additional targets for therapeutic strategies.

[0130] Another embodiment of the present invention relates to aplurality of polynucleotides for the detection of the expression ofgenes regulated by progesterone receptors in breast tissue. Theplurality of polynucleotides consists of polynucleotide probes that arecomplementary to RNA transcripts, or nucleotides derived therefrom, ofgenes that are regulated by progesterone receptors, and is thereforedistinguished from previously known nucleic acid arrays and primer sets.The plurality of polynucleotides within the above-limitation includes atleast one or more, but is not limited to one or more, polynucleotideprobes that are complementary to RNA transcripts, or nucleotides derivedtherefrom, of genes identified by the present inventors. Such genes areselected from: (a) at least one gene that is selectively upregulated byPR-A chosen from a gene in Table 1; (b) at least one gene that isselectively downregulated by PR-A chosen from a gene in Table 2; (c) atleast one gene that is selectively upregulated by PR-B chosen from agene in Table 3; (d) at least one gene that is selectively downregulatedby PR-B chosen from a gene in Table 4; (e) at least one gene that isupregulated or downregulated by both PR-A and PR-B chosen from a gene inTable 5; (f) at least one gene that is reciprocally regulated by PR-Aand PR-B chosen from a gene in Table 6; and, (g) at least one gene thatis regulated by one of the PR-A or the PR-B, wherein regulation of thegene is altered when the other of the PR-A or PR-B is expressed by thesame cell, chosen from a gene in Table 7.

[0131] In one embodiment, it is contemplated that additional genes thatare not regulated by progesterone receptors can be added to theplurality of polynucleotides. Such genes would not be random genes, orlarge groups of unselected human genes, as are commercially availablenow, but rather, would be specifically selected to complement the setsof progesterone receptor-regulated genes identified by the presentinvention. For example, one of skill in the art may wish to add to theabove-described plurality of genes one or more genes that are ofrelevance because they are expressed by a particular tissue of interest(e.g., breast tissue), are associated with a particular disease orcondition of interest (e.g., breast cancer), or are associated with aparticular cell, tissue or body function (e.g., angiogenesis). Thedevelopment of additional pluralities of polynucleotides (andantibodies, as disclosed below), which include both the above-describedplurality and such additional selected polynucleotides, are explicitlycontemplated by the present invention.

[0132] In one embodiment, the plurality of polynucleotides furthercomprises at least one polynucleotide probe that is complementary to RNAtranscripts, or nucleotides derived therefrom, of at least one genechosen from the genes in Table 8. In another embodiment, the pluralityof polynucleotides comprises polynucleotide probes that arecomplementary to RNA transcripts, or nucleotides derived therefrom, ofparticular subsets of the genes disclosed in the present invention. Forexample, one of skill in the art may wish to design pluralities ofpolynucleotides on the basis of the function of the gene or geneproduct, on the basis of a tissue-type that expresses a PR, on the basisof PR isoform specificity, on the basis of association with a particularcondition or disease, or on the basis of the change in the level ofexpression of the gene when in the presence of progesterone. Suchembodiments have generally been described above.

[0133] According to the present invention, a plurality ofpolynucleotides refers to at least 2, and more preferably at least 3,and more preferably at least 4, and more preferably at least 5, and morepreferably at least 6, and more preferably at least 7, and morepreferably at least 8, and more preferably at least 9, and morepreferably at least 10, and so on, in increments of one, up to anysuitable number of polynucleotides, including at least 100, 500, 1000,10⁴, 10⁵, or at least 10⁶ or more polynucleotides.

[0134] In accordance with the present invention, an isolatedpolynucleotide, or an isolated nucleic acid molecule, is a nucleic acidmolecule that has been removed from its natural milieu (i.e., that hasbeen subject to human manipulation), its natural milieu being the genomeor chromosome in which the nucleic acid molecule is found in nature. Assuch, “isolated” does not necessarily reflect the extent to which thenucleic acid molecule has been purified, but indicates that the moleculedoes not include an entire genome or an entire chromosome in which thenucleic acid molecule is found in nature. The polynucleotides useful inthe plurality of polynucleotides of the present invention are typicallya portion of a gene of the present invention that is suitable for use asa hybridization probe or PCR primer for the identification of afull-length gene (or portion thereof) in a given sample (e.g., a cellsample). An isolated nucleic acid molecules can include a gene or aportion of a gene (e.g., the regulatory region or promoter), forexample, to produce a reporter construct according to the presentinvention. An isolated nucleic acid molecule that includes a gene is nota fragment of a chromosome that includes such gene, but rather includesthe coding region and regulatory regions associated with the gene, butno additional genes naturally found on the same chromosome. An isolatednucleic acid molecule can also include a specified nucleic acid sequenceflanked by (i.e., at the 5′ and/or the 3′ end of the sequence)additional nucleic acids that do not normally flank the specifiednucleic acid sequence in nature (i.e., heterologous sequences). Isolatednucleic acid molecule can include DNA, RNA (e.g., mRNA), or derivativesof either DNA or RNA (e.g., cDNA). Although the phrase “nucleic acidmolecule” primarily refers to the physical nucleic acid molecule and thephrase “nucleic acid sequence” primarily refers to the sequence ofnucleotides on the nucleic acid molecule, the two phrases can be usedinterchangeably, especially with respect to a nucleic acid molecule, ora nucleic acid sequence, being capable of encoding a protein.Preferably, an isolated nucleic acid molecule of the present inventionis produced using recombinant DNA technology (e.g., polymerase chainreaction (PCR) amplification, cloning) or chemical synthesis. If thepolynucleotide is an oligonucleotide probe, the probe preferably rangesfrom about 5 to about 50 or about 500 nucleotides, more preferably fromabout 10 to about 40 nucleotides, and most preferably from about 15 toabout 40 nucleotides in length.

[0135] In one embodiment, the polynucleotide probes are conjugated todetectable markers. Detectable labels suitable for use in the presentinvention include any composition detectable by spectroscopic,photochemical, biochemical, immunochemical, electrical, optical orchemical means. Useful labels in the present invention include biotinfor staining with labeled streptavidin conjugate, magnetic beads (e.g.,Dynabeads.TM.), fluorescent dyes (e.g., fluorescein, texas red,rhodamine, green fluorescent protein, and the like), radiolabels (e.g.,³H, ¹²⁵I, ³⁵S, ¹⁴C, or ³²P), enzymes (e.g., horse radish peroxidase,alkaline phosphatase and others commonly used in an ELISA), andcolorimetric labels such as colloidal gold or colored glass or plastic(e.g., polystyrene, polypropylene, latex, etc.) beads. Preferably, thepolynucleotide probes are immobilized on a substrate.

[0136] In one embodiment, the polynucleotide probes are hybridizablearray elements in a microarray or high density array. Nucleic acidarrays are well known in the art and are described for use in comparingexpression levels of particular genes of interest, for example, in U.S.Pat. No. 6,177,248, which is incorporated herein by reference in itsentirety. Nucleic acid arrays are suitable for quantifying a smallvariations in expression levels of a gene in the presence of a largepopulation of heterogeneous nucleic acids. Knowing the identity of thedownstream genes of the present invention, nucleic acid arrays can befabricated either by de novo synthesis on a substrate or by spotting ortransporting nucleic acid sequences onto specific locations ofsubstrate. Nucleic acids are purified and/or isolated from biologicalmaterials, such as a bacterial plasmid containing a cloned segment ofsequence of interest. It is noted that all of the genes identified bythe present invention have been previously sequenced, at least in part,such that oligonucleotides suitable for the identification of suchnucleic acids can be produced. The database accession number for each ofthe genes identified by the present inventors is provided in the tablesof the invention. Suitable nucleic acids are also produced byamplification of template, such as by polymerase chain reaction or invitro transcription.

[0137] Synthesized oligonucleotide arrays are particularly preferred forthis aspect of the invention. Oligonucleotide arrays have numerousadvantages, as opposed to other methods, such as efficiency ofproduction, reduced intra- and inter array variability, increasedinformation content and high signal-to-noise ratio.

[0138] One of skill in the art will appreciate that an enormous numberof array designs are suitable for the practice of this invention. Anarray will typically include a number of probes that specificallyhybridize to the sequences of interest. In addition, in a preferredembodiment, the array will include one or more control probes. Thehigh-density array chip includes “test probes.” Test probes could beoligonucleotides that range from about 5 to about 45 or 5 to about 500nucleotides, more preferably from about 10 to about 40 nucleotides andmost preferably from about 15 to about 40 nucleotides in length. Inother particularly preferred embodiments the probes are 20 or 25nucleotides in length. In another preferred embodiments, test probes aredouble or single strand DNA sequences. DNA sequences are isolated orcloned from natural sources or amplified from natural sources usingnatural nucleic acids as templates, or produced synthetically. Theseprobes have sequences complementary to particular subsequences of thegenes whose expression they are designed to detect. Thus, the testprobes are capable of specifically hybridizing to the target nucleicacid they are to detect.

[0139] Another embodiment of the present invention relates to aplurality of antibodies, or antigen binding fragments thereof, for thedetection of the expression of genes regulated by progesterone receptorsin breast tissue. The plurality of antibodies, or antigen bindingfragments thereof, consists of antibodies, or antigen binding fragmentsthereof, that selectively bind to proteins encoded by genes that areregulated by progesterone receptors. In addition, the plurality ofantibodies, or antigen binding fragments thereof, comprises antibodies,or antigen binding fragments thereof, that selectively bind to proteinsencoded by genes selected from the group consisting of: (a) genes thatare selectively upregulated by PR-A chosen from genes in Table 1; (b)genes that are selectively downregulated by PR-A chosen from genes inTable 2; (c) genes that are selectively upregulated by PR-B chosen fromgenes in Table 3; (d) genes that are selectively downregulated by PR-Bchosen from genes in Table 4; (e) genes that are upregulated ordownregulated by both PR-A and PR-B chosen from genes in Table 5; (f)genes that are reciprocally regulated by PR-A and PR-B chosen from genesin Table 6; and, (g) genes that are regulated by one of the PR-A or thePR-B, wherein regulation of the gene is altered when the other of thePR-A or PR-B is expressed by the same cell, chosen from genes in Table7.

[0140] In one aspect, the plurality of antibodies, or antigen bindingfragments thereof, further comprises at least one antibody, or anantigen binding fragment thereof, that selectively binds to a proteinencoded by a gene chosen from the genes in Table 8.

[0141] The plurality of antibodies, or antigen binding fragmentsthereof, further comprises at least one antibody, or an antigen bindingfragment thereof, that selectively binds to a protein encoded by a oneor more of a particular subset of the genes disclosed in the presentinvention. For example, one of skill in the art may wish to designpluralities of antibodies on the basis of the function of the geneproduct, on the basis of tissue-type, on the basis of PR isoformspecificity, on the basis of association with a particular condition ordisease, or on the basis of the change in the level of expression of thegene when in the presence of progesterone. Such embodiments havegenerally been described above.

[0142] According to the present invention, a plurality of antibodies, orantigen binding fragments thereof, refers to at least 2, and morepreferably at least 3, and more preferably at least 4, and morepreferably at least 5, and more preferably at least 6, and morepreferably at least 7, and more preferably at least 8, and morepreferably at least 9, and more preferably at least 10, and so on, inincrements of one, up to any suitable number of antibodies, or antigenbinding fragments thereof, including at least 100, 500, or at least 1000antibodies, or antigen binding fragments thereof.

[0143] According to the present invention, the phrase “selectively bindsto” refers to the ability of an antibody, antigen binding fragment orbinding partner of the present invention to preferentially bind tospecified proteins (e.g., a protein encoded by a PR regulated geneaccording to the present invention). The phrase “selectively binds” withregard to antibodies and antigen binding fragments thereof, has beendefined previously herein.

[0144] Limited digestion of an immunoglobulin with a protease mayproduce two fragments. An antigen binding fragment is referred to as anFab, an Fab′, or an F(ab′)₂ fragment. A fragment lacking the ability tobind to antigen is referred to as an Fc fragment. An Fab fragmentcomprises one arm of an immunoglobulin molecule containing a L chain(V_(L)+C_(L) domains) paired with the V_(H) region and a portion of theC_(H) region (CH1 domain). An Fab′ fragment corresponds to an Fabfragment with part of the hinge region attached to the CH1 domain. AnF(ab′)₂ fragment corresponds to two Fab′ fragments that are normallycovalently linked to each other through a di-sulfide bond, typically inthe hinge regions.

[0145] Isolated antibodies of the present invention can include serumcontaining such antibodies, or antibodies that have been purified tovarying degrees. Whole antibodies of the present invention can bepolyclonal or monoclonal. Alternatively, functional equivalents of wholeantibodies, such as antigen binding fragments in which one or moreantibody domains are truncated or absent (e.g., Fv, Fab, Fab′, or F(ab)₂fragments), as well as genetically-engineered antibodies or antigenbinding fragments thereof, including single chain antibodies orantibodies that can bind to more than one epitope (e.g., bi-specificantibodies), or antibodies that can bind to one or more differentantigens (e.g., bi- or multi-specific antibodies), may also be employedin the invention.

[0146] Generally, in the production of an antibody, a suitableexperimental animal, such as, for example, but not limited to, a rabbit,a sheep, a hamster, a guinea pig, a mouse, a rat, or a chicken, isexposed to an antigen against which an antibody is desired. Typically,an animal is immunized with an effective amount of antigen that isinjected into the animal. An effective amount of antigen refers to anamount needed to induce antibody production by the animal. The animal'simmune system is then allowed to respond over a pre-determined period oftime. The immunization process can be repeated until the immune systemis found to be producing antibodies to the antigen. In order to obtainpolyclonal antibodies specific for the antigen, serum is collected fromthe animal that contains the desired antibodies (or in the case of achicken, antibody can be collected from the eggs). Such serum is usefulas a reagent. Polyclonal antibodies can be further purified from theserum (or eggs) by, for example, treating the serum with ammoniumsulfate.

[0147] Monoclonal antibodies may be produced according to themethodology of Kohler and Milstein (Nature 256:495-497, 1975). Forexample, B lymphocytes are recovered from the spleen (or any suitabletissue) of an immunized animal and then fused with myeloma cells toobtain a population of hybridoma cells capable of continual growth insuitable culture medium. Hybridomas producing the desired antibody areselected by testing the ability of the antibody produced by thehybridoma to bind to the desired antigen.

[0148] Finally, PR-regulated genes of this invention, or their RNA orprotein products, can serve as targets for therapeutic strategies. Forexample, neutralizing antibodies could be directed against one of theprotein products of a selected gene, expressed on the surface of a tumorcell.

[0149] One embodiment of this aspect of the invention relates to amethod to regulate the expression of a gene selected from the groupconsisting of any one or more of the genes in Tables 1-7. The methodincludes administering to a cell that expresses a progesterone receptora compound selected from the group consisting of: progesterone, aprogestin, and an antiprogestin, wherein the compound is effective toregulate the expression of the gene(s) in Table 1-7. In a preferredembodiment, the gene is selected from the group consisting of genes thatare listed in Table 16 (known to be involved in breast cancer or mammarygland development), but not in Table 8 (known to be regulated byprogesterone). Such genes include, e.g., growth arrest-specific protein(gas6), NF-IL6-beta (C/EBPbeta), calcium-binding protein S100P, MSX-2,selenium-binding protein (hSBP), and bullous pemphigoid antigen (plakinfamily). In this aspect of the invention, the cell that expresses aprogesterone receptor is in the breast tissue of a patient that has, oris at risk of developing, breast cancer. In addition to administering aprogestin to the cell, these genes can serve as targets for thedevelopment of other therapeutic methods.

[0150] Once a suitable therapeutic compound, including a progesteronereceptor agonist or antagonist, is identified using the methods andgenes of the present invention, a composition can be formulated. Acomposition, and particularly a therapeutic composition, of the presentinvention generally includes the therapeutic compound (e.g., theprogesterone receptor regulatory ligand) and a carrier, and preferably,a pharmaceutically acceptable carrier. According to the presentinvention, a “pharmaceutically acceptable carrier” includespharmaceutically acceptable excipients and/or pharmaceuticallyacceptable delivery vehicles, which are suitable for use inadministration of the composition to a suitable in vitro, ex vivo or invivo site. A suitable in vitro, in vivo or ex vivo site is preferably acell that expresses a progesterone receptor. In some embodiments, asuitable site for delivery is a site of inflammation, a site of a tumor,a site of a transplanted graft, or a site of any other disease orcondition in which progesterone receptor regulation, or modulation ofgenes regulated by a PR, can be beneficial, particularly given theknowledge of the genes regulated by PR according to the invention.Preferred pharmaceutically acceptable carriers are capable ofmaintaining a steroidal or non-steroidal compound, a protein, a peptide,nucleic acid molecule or mimetic (drug) according to the presentinvention in a form that, upon arrival of the steroidal or non-steroidalcompound, protein, peptide, nucleic acid molecule or mimetic at the celltarget in a culture or in patient, the steroidal or non-steroidalcompound, protein, peptide, nucleic acid molecule or mimetic is capableof interacting with its target (e.g., a naturally occurring PR or anucleic acid or protein product of a PR-regulated gene).

[0151] Suitable excipients of the present invention include excipientsor formularies that transport or help transport, but do not specificallytarget a composition to a cell (also referred to herein as non-targetingcarriers). Examples of pharmaceutically acceptable excipients include,but are not limited to water, phosphate buffered saline, Ringer'ssolution, dextrose solution, serum-containing solutions, Hank'ssolution, other aqueous physiologically balanced solutions, oils, estersand glycols. Aqueous carriers can contain suitable auxiliary substancesrequired to approximate the physiological conditions of the recipient,for example, by enhancing chemical stability and isotonicity.

[0152] Suitable auxiliary substances include, for example, sodiumacetate, sodium chloride, sodium lactate, potassium chloride, calciumchloride, and other substances used to produce phosphate buffer, Trisbuffer, and bicarbonate buffer. Auxiliary substances can also includepreservatives, such as thimerosal, m- or o-cresol, formalin and benzolalcohol. Compositions of the present invention can be sterilized byconventional methods and/or lyophilized.

[0153] One type of pharmaceutically acceptable carrier includes acontrolled release formulation that is capable of slowly releasing acomposition of the present invention into a patient or culture. As usedherein, a controlled release formulation comprises a compound of thepresent invention (e.g., a protein (including homologues), a drug, anantibody, a nucleic acid molecule, or a mimetic) in a controlled releasevehicle. Suitable controlled release vehicles include, but are notlimited to, biocompatible polymers, other polymeric matrices, capsules,microcapsules, microparticles, bolus preparations, osmotic pumps,diffusion devices, liposomes, lipospheres, and transdermal deliverysystems. Other carriers of the present invention include liquids that,upon administration to a patient, form a solid or a gel in situ.Preferred carriers are also biodegradable (i.e., bioerodible). When thecompound is a recombinant nucleic acid molecule, suitable deliveryvehicles include, but are not limited to liposomes, viral vectors orother delivery vehicles, including ribozymes. Natural lipid-containingdelivery vehicles include cells and cellular membranes. Artificiallipid-containing delivery vehicles include liposomes and micelles. Adelivery vehicle of the present invention can be modified to target to aparticular site in a patient, thereby targeting and making use of acompound of the present invention at that site. Suitable modificationsinclude manipulating the chemical formula of the lipid portion of thedelivery vehicle and/or introducing into the vehicle a targeting agentcapable of specifically targeting a delivery vehicle to a preferredsite, for example, a preferred cell type. Other suitable deliveryvehicles include gold particles, poly-L-lysine/DNA-molecular conjugates,and artificial chromosomes.

[0154] A pharmaceutically acceptable carrier which is capable oftargeting is herein referred to as a “delivery vehicle.” Deliveryvehicles of the present invention are capable of delivering acomposition of the present invention to a target site in a patient. A“target site” refers to a site in a patient to which one desires todeliver a composition. For example, a target site can be any cell whichis targeted by direct injection or delivery using liposomes, viralvectors or other delivery vehicles, including ribozymes and antibodies.Examples of delivery vehicles include, but are not limited to,artificial and natural lipid-containing delivery vehicles, viralvectors, and ribozymes. Natural lipid-containing delivery vehiclesinclude cells and cellular membranes. Artificial lipid-containingdelivery vehicles include liposomes and micelles. A delivery vehicle ofthe present invention can be modified to target to a particular site ina subject, thereby targeting and making use of a compound of the presentinvention at that site. Suitable modifications include manipulating thechemical formula of the lipid portion of the delivery vehicle and/orintroducing into the vehicle a compound capable of specificallytargeting a delivery vehicle to a preferred site, for example, apreferred cell type. Specifically, targeting refers to causing adelivery vehicle to bind to a particular cell by the interaction of thecompound in the vehicle to a molecule on the surface of the cell.Suitable targeting compounds include ligands capable of selectively(i.e., specifically) binding another molecule at a particular site.Examples of such ligands include antibodies, antigens, receptors andreceptor ligands. Manipulating the chemical formula of the lipid portionof the delivery vehicle can modulate the extracellular or intracellulartargeting of the delivery vehicle. For example, a chemical can be addedto the lipid formula of a liposome that alters the charge of the lipidbilayer of the liposome so that the liposome fuses with particular cellshaving particular charge characteristics.

[0155] One preferred delivery vehicle of the present invention is aliposome. A liposome is capable of remaining stable in an animal for asufficient amount of time to deliver a nucleic acid molecule (e.g., ananti-sense nucleic acid molecule that hybridizes to a nucleic acidsequence in a gene for which inhibition is desired) to a preferred sitein the animal. A liposome, according to the present invention, comprisesa lipid composition that is capable of delivering a nucleic acidmolecule described in the present invention to a particular, orselected, site in a patient. A liposome according to the presentinvention comprises a lipid composition that is capable of fusing withthe plasma membrane of the targeted cell to deliver a nucleic acidmolecule into a cell. Suitable liposomes for use with the presentinvention include any liposome. Preferred liposomes of the presentinvention include those liposomes commonly used in, for example, genedelivery methods known to those of skill in the art. More preferredliposomes comprise liposomes having a polycationic lipid compositionand/or liposomes having a cholesterol backbone conjugated topolyethylene glycol. Complexing a liposome with a nucleic acid moleculeof the present invention can be achieved using methods standard in theart.

[0156] A liposome delivery vehicle is preferably capable of remainingstable in a patient for a sufficient amount of time to deliver a nucleicacid molecule or other compound of the present invention to a preferredsite in the patient (i.e., a target cell). A liposome delivery vehicleof the present invention is preferably stable in the patient into whichit has been administered for at least about 30 minutes, more preferablyfor at least about 1 hour and even more preferably for at least about 24hours. A preferred liposome delivery vehicle of the present invention isfrom about 0.01 microns to about 1 microns in size.

[0157] Another preferred delivery vehicle comprises a viral vector. Aviral vector includes an isolated nucleic acid molecule useful in thepresent invention, in which the nucleic acid molecules are packaged in aviral coat that allows entrance of DNA into a cell. A number of viralvectors can be used, including, but not limited to, those based onalphaviruses, poxviruses, adenoviruses, herpesviruses, lentiviruses,adeno-associated viruses and retroviruses.

[0158] A composition which includes an agonist or antagonist of aprogesterone receptor can be delivered to a cell culture or patient byany suitable method. Selection of such a method will vary with the typeof compound being administered or delivered (i.e., steroidal ornon-steroidal compound, protein, peptide, nucleic acid molecule, ormimetic), the mode of delivery (i.e., in vitro, in vivo, ex vivo) andthe goal to be achieved by administration/delivery of the compound orcomposition. According to the present invention, an effectiveadministration protocol (i.e., administering a composition in aneffective manner) comprises suitable dose parameters and modes ofadministration that result in delivery of a composition to a desiredsite (i.e., to a desired cell) and/or in the desired regulatory event(e.g., regulation of the PR receptor biological activity or of thebiological activity of a gene that is regulated by PR).

[0159] Administration routes include in vivo, in vitro and ex vivoroutes. In vivo routes include, but are not limited to, oral, nasal,intratracheal injection, inhaled, transdermal, rectal, and parenteralroutes. Preferred parenteral routes can include, but are not limited to,subcutaneous, intradermal, intravenous, intramuscular andintraperitoneal routes. Intravenous, intraperitoneal, intradermal,subcutaneous and intramuscular administrations can be performed usingmethods standard in the art. Aerosol (inhalation) delivery can also beperformed using methods standard in the art (see, for example, Striblinget al., Proc. Natl. Acad. Sci. USA 189:11277-11281, 1992, which isincorporated herein by reference in its entirety). Oral delivery can beperformed by complexing a therapeutic composition of the presentinvention to a carrier capable of withstanding degradation by digestiveenzymes in the gut of an animal. Examples of such carriers, includeplastic capsules or tablets, such as those known in the art. Directinjection techniques are particularly useful for suppressing graftrejection by, for example, injecting the composition into thetransplanted tissue, or for site-specific administration of a compound,such as at the site of a tumor. Ex vivo refers to performing part of theregulatory step outside of the patient, such as by transfecting apopulation of cells removed from a patient with a recombinant moleculecomprising a nucleic acid sequence encoding a protein according to thepresent invention under conditions such that the recombinant molecule issubsequently expressed by the transfected cell, and returning thetransfected cells to the patient. In vitro and ex vivo routes ofadministration of a composition to a culture of host cells can beaccomplished by a method including, but not limited to, transfection,transformation, electroporation, microinjection, lipofection,adsorption, protoplast fusion, use of protein carrying agents, use ofion carrying agents, use of detergents for cell permeabilization, andsimply mixing (e.g., combining) a compound in culture with a targetcell.

[0160] In the method of the present invention, a therapeutic compound,including agonists and antagonists of progesterone receptors, as well ascompositions comprising such compounds, can be administered to anyorganism, and particularly, to any member of the Vertebrate class,Mammalia, including, without limitation, primates, rodents, livestockand domestic pets. Livestock include mammals to be consumed or thatproduce useful products (e.g., sheep for wool production). Preferredmammals to protect include humans. Typically, it is desirable tomodulate (e.g., regulate (up or down)) progesterone receptor biologicalactivity or the biological activity of a gene regulated by a PR, toobtain a therapeutic benefit in a patient. A therapeutic benefit is notnecessarily a cure for a particular disease or condition, but rather,preferably encompasses a result which can include alleviation of thedisease or condition, elimination of the disease or condition, reductionof a symptom associated with the disease or condition, prevention oralleviation of a secondary disease or condition resulting from theoccurrence of a primary disease or condition, and/or prevention of thedisease or condition. As used herein, the phrase “protected from adisease” refers to reducing the symptoms of the disease; reducing theoccurrence of the disease, and/or reducing the severity of the disease.Protecting a patient can refer to the ability of a composition of thepresent invention, when administered to a patient, to prevent a diseasefrom occurring and/or to cure or to alleviate disease symptoms, signs orcauses. As such, to protect a patient from a disease includes bothpreventing disease occurrence (prophylactic treatment) and treating apatient that has a disease (therapeutic treatment) to reduce thesymptoms of the disease. A beneficial effect can easily be assessed byone of ordinary skill in the art and/or by a trained clinician who istreating the patient. The term, “disease” refers to any deviation fromthe normal health of a mammal and includes a state when disease symptomsare present, as well as conditions in which a deviation (e.g.,infection, gene mutation, genetic defect, etc.) has occurred, butsymptoms are not yet manifested.

[0161] The following examples are provided for the purpose ofillustration and are not intended to limit the scope of the presentinvention.

EXAMPLES Example 1 The following example describes the identification ofgenes regulated by progesterone receptors.

[0162] Materials and Methods

[0163] Cell Culture

[0164] Wild-type PR-positive T47Dco breast cancer cell line and itsclonal derivatives T47D-Y, T47D-YA and T47D-YB, have been described(Horwitz et al., Cell 28, 633-42 (1982); Sartorius et al., Cancer Res.54, 3668-3877 (1994)). Briefly, cells are routinely cultured in 75 cm²plastic flasks and incubated in 5% CO₂ at 37° C. in a humidifiedenvironment. The stock medium consists of Eagle's Minimum EssentialMedium with Earle's salts (MEM), containing L-glutamine (292 mg/liter)buffered with sodium bicarbonate (2.2 g/liter), insulin (6 ng/ml) and 5%fetal bovine serum (Hyclone, Logan, Utah) with G418.

[0165] Arrays

[0166] Atlas™Human cDNA Expression Array. T47D-YA and T47D-YB breastcancer cells were grown to mid-confluence in Minimal Essential Mediumcontaining 5% Fetal Calf Serum, then either treated with 10 nMprogesterone dissolved in ethanol for 6 or 12 hours, or in ethanolalone. This yielded 4 treatment types. Total RNA was prepared from the 4sets of cells using guanidinium isothiocyanate, polyA⁺ RNA was purifiedwith the Oligotex mRNA Kit (Qiagen, Valencia, Calif.), and ³²P-labeledcDNA was synthesized from 1 ug of each sample using SuperScriptIIreverse transcriptase (Gibco BRL Life Technologies, Gaithersburg, Md.).Labeled probes were separately hybridized to Atlas™ Human cDNAExpression Arrays (Clontech, Palo Alto, Calif.) consisting of nylonmembranes onto which 588 cDNA fragments encoding known proteins werespotted in duplicate. After a high stringency wash, hybridization wasdetected by autoradiography and phosphoimaging on a MolecularDynamicsPhosphoImager™ (Molecular Dynamics, Sunnyvale, Calif.). Datawere analyzed using Atlas™ Image 1.0, and normalized to signals fromcontrol housekeeping genes on the same filter. For selected genes,progesterone inducibility and PR-isoform specificity were confirmed bynorthern blotting, reverse transcriptase-polymerase chain reaction(RT-PCR), and/or western blotting.

[0167] Affymetrix GeneChip™Array. T47D-Y, T47D-YA and T47D-YB breastcancer cells were grown to mid-confluence in Minimal Essential Mediumcontaining 5% Fetal Calf Serum, then either treated with 10 nMprogesterone dissolved in ethanol for 6 hours, or in ethanol alone. Thisyielded 6 treatment types. Total RNA and polyA⁺ RNA were prepared fromthe 6 sets, as described above. First strand cDNA was synthesized from 2ug of polyA⁺ RNA using SSII Reverse Transcriptase, the T7dT 24mer, andother components of the Superscript Choice system (Gibco BRL LifeTechnologies, Gaithersburg, Md.). Following second strand synthesis, theDNA was purified by phenol/chloroform extraction and precipitation, andresuspended in 12 ul DEPC-treated RNase water. 5 ul were used in an invitro transcription reaction using the EnZo BioArray™ High Yieldtranscript Labeling Kit (Affymetrix, Inc., Santa Clara, Calif.), tosynthesize RNA transcripts and incorporate biotin labeledribonucleotides. Unincorporated nucleotides were removed with RNeasyaffinity columns (Qiagen, Valencia, Calif.). Purified, biotinylatedcRNAs were quantified, and 20 ug were subjected to a fragmentationreaction by incubation at 94C for 35 min (Affymetrix™ protocol 700218)to randomly generate fragments ranging from 35 to 200 bases. HuGeneFLArray™ chips consisting of 5,600 full-length human genes from Unigene,Genebank and TIGR databases were used for hybridization. Thirty μl offragmented cRNA were added to a hybridization mixture (100 mM MES, 1 MNaCl, 20 mMEDTA, and 0.01% Tween 20) and control oligonucleotide B2 andcontrol cRNA cocktail, as described in the Affymetrix™ protocol.Hybridizations and subsequent washes were done in the GeneChipHybridization Oven and Fluidics Station 400. After overnighthybridization, the solutions were removed, the chips were washed andstained with streptavidin-phycoerythrin. DNA chips were read at aresolution of 6 um with a Hewlett-Packard GeneArray Scanner.

[0168] Each gene on the chip is represented by perfectly matched (PM)and mismatched (MM) oligonucleotides from 16-20 regions of each gene.The mismatched probes act as specificity controls, which allow directsubtraction of background and cross-hybridization signals. The number ofinstances in which the PM hybridization signal is larger that the MMsignal is computed along with the average of the logarithm of the PM:MMratio (after background subtraction) for each probe set. These valueswere used to arrive at a matrix-based decision concerning the presenceor absence of an RNA transcript. Detailed protocols for data analyses ofAffymetrix microarrays and extensive documentation of the sensitivityand quantitative aspects of the method have been described. Briefly, thefirst level of analysis including the “present” or “absent” call, andpairwise comparisons, were done using GeneChip 3.1 Expression AnalysisProgram™ (Affymetrix, Inc., Santa Clara, Calif.). A second level ofanalysis to identify clusters of genes regulated by progesterone viaPR-A, PR-B or both was performed using GeneSpring™ version 3.0 (SiliconGenetics, San Carlos, Calif.). The present inventors used customizedsoftware capable of comparing multiple experimental pairwise comparisons(minus versus plus progesterone) and multiple control comparisons (allminus hormone samples and all plus hormone samples) to compare foldchange minus versus plus hormone as compared to the fold change betweencontrols. This served as a measure of the variability between samples.As a third level of analysis, k-means clustering was performed usingGeneSpring™ version 3.2.12 (Silicon Genetics, San Carlos, Calif.) toidentify patterns of gene regulation in PR-A, PR-B, or PR-negative cellstreated with or without progesterone.

[0169] Selected genes, i.e., ones that were substantially regulated orare of particular biological interest, have been confirmed by northernand/or RT-PCR, and/or by western blotting. Additionally, the promotersof several genes of interest have been cloned, linked upstream of aluciferase reporter, and tested for their ability to betranscriptionally regulated by PR-A vs. PR-B after transfection intoHeLa cervicocarcinoma cells, followed by progesterone treatment of thecells. In the examples tested, regulation by PR-A vs. PR-B using thesynthetic promoter/reporter constructs, mimicked the regulation of theendogenous genes in the breast cancer cells, supporting the use of theseapproaches for drug discovery.

[0170] RT-PCR and Northern Blot Analysis

[0171] RT-PCR amplifications of target sequences were performed withco-amplification of an internal control sequence (p2MG or GAPDH) using:P2MG forward primer: 5′-ATCCAGCGTACTCCAAAGATTC-3′ (SEQ ID NO:1); β2MGreverse primer: 5′-TCCTTGCTGAAAGACAAGTCTG-3′ (SEQ ID NO:2); resulting ina product of 178 bp. GAPDH primers yielded a product of 485 bp. GAPDH,Integrin α6, and bcl-x cDNA primer sequences were obtained fromClontech. Total RNA was prepared from T47DY-A or -B cells as describedabove. One μg of RNA was mixed with 0.4 μM random hexamers and heated to65° C. for five min. (Perkin Elmer). 1×PCR buffer (5 mM MgCl₂), 20 URNAse inhibitor, 4 mM dNTPs, and 125 U MMLV reverse transcriptase wereadded and tubes were incubated at 42° C. for 1 hour. Five μl of the cDNAsynthesis reactions were added to 1×PCR buffer, 1.8 mM MgCl₂, 100 mMdNTP blend, and 60 pmoles of specific primers were incubated with 5 UAmpliTaq DNA polymerase at 94° C. for 30 s, 65 C for 45 s, and 68° C.for 1 min for 16-18 cycles (cycle number was chosen to be in the linearrange of amplification for each product). All PCR reagents werepurchased from Perkin Elmer, Foster City, Calif. Five μl of samples wereresolved on a 2% agarose gel, and Southern blots were performed in 0.4M.Blots were prehybridized in Rapid-hyb (Amersham) for 1 h at 65° C. cDNAprobes were generated by RT-PCR and radioactively labeled usingMegaPrime DNA labeling system (Amersham) and ³²P-αdCTP. Blots wereprobed for 2 h to overnight at 65° C. Blots were washed and exposed toautoradiography film or phosphoimaging screen and then quantified usingImageQuant, Molecular Dynamics. In some cases the RT-PCR products couldbe visualized on an ethidium bromide stained gel when amplified in thelinear range of production and in these cases Southern blotting andhybridizing with a labeled probe was unnecessary and products wereinstead directly quantitated. In some cases Northern blot analysis wasused to detect transcripts. In these cases 25 μg of total RNA waselectrophoresed in a formaldehyde agarose gel and transferred to aHybond nylon membrane (Amersham) and hybridized sequentially with cDNAinserts for specific genes generated by random priming PCR productsgenerate as above with ³²P dCTP using Mega-Prime DNA Labeling Kit(Amersham). Membranes were then probed with fragments of housekeepinggenes (either B2MG or GAPDH).

[0172] Transcriptional Assay:

[0173] HeLa cells plated at 4×10⁵ cells per 10 cm dish in MEMsupplemented with 5% fetal bovine serum were then transientlytransfected with 100 ng of HPR1 (PR-B in pSG5) or HPR2 (PR-A in pSG5)and 1.2 μg of the integrin a6 promoter (−740) in pGL3-Basic vectorplasmid (gift from Dr. Sohei Kitazawa, Kobe University School ofMedicine, Department of Pathology), 1.2 μg of β-galactosidase expressionplasmid pCH110, and 5.5 μg BSM treated with 10 mM progesterone orethanol vehicle for 24 hours.

[0174] Immunoblots:

[0175] For time course treatments with progesterone, cells were platedat 2 million cells per large plates in MEM with supplements describedabove and were treated with 10 nM progesterone (Sigma). Cells wereharvested in RIPA buffer (10 mM sodium phosphate, pH 7.0, 150 mM NaCl, 2mM EDTA, 1% deoxycholic acid, 1% Nonedet P-40, 0.1% SDS, 0.1%β-mercaptoethanol, 1 mM PMSF, 50 mM sodium fluoride, 200 μM Va₃VO₄, andone Complete Protease Inhibitor Mixture tablet (Boehringer Marnheim,GmbH Germany) per 50 mls of RIPA buffer made fresh for each use. Proteinextracts were equalized to 150 μg by Bradford assay (Bio-Rad), resolvedby SDS-PAGE, and transferred to nitrocellulose. Equivalent proteinloading was confirmed by Ponceau S staining. Following incubation withthe appropriate antibodies, and HRP-conjugated secondary antibodies,protein bands were detected by enhanced chemiluminescence (Amersham,Arlington Heights, Ill.).

[0176] Results

[0177] Gene expression data from Affymetrix HuGeneFL Array™ chips wereanalyzed using Microarray Suite 4.0 Expression Analysis Program(Affymetrix™). Experimental data from independent triplicate experimentsfor T47D-YA and T47D-YB cells and duplicate T47D-Y cells treated with orwithout 10 nM progesterone were analyzed and pairwise comparisons wereperformed to identify genes that had increased or decreased withaddition of hormone. These data were imported into Microsoft Excel andcustom formulas were written to identify genes that had repeatedlyincreased or decreased with hormone in three out of three experiments byat least 1.8 fold, but did not vary more than two fold between controlgroups. Genes that met these criteria and were up- or downregulated byprogesterone by in PR-B containing cells are shown in Table 18, whilethose up- or downregulated by progesterone in PR-A containing cells areshown in Table 19. In both tables fold increases and decreases (negativenumbers) upon treatment with progesterone for 6 hrs are indicated. Geneswhich were at below detectable levels and called absent in one sample,but which were detectable and called as present in the other are denotedwith a tilde beside the fold changes. The fold changes indicated with atilde cannot be compared to those that are not marked with a tilde(indicating they were present in both minus and plus hormone samples) asthe fold change was calculated by setting the undetectable gene tobackground level. Genes in bold in Table 18 are uniquely regulated byprogesterone only via PR-B, while those in bold in Table 19 are uniquelyregulated by PR-A; those not bolded were regulated in both PR-B and PR-Acontaining cells. Only genes that were regulated in 3 out of 3experiments are shown and average fold inductions are given. Genesmarked with an asterisk were identified from Atlas™ Human cDNAExpression Arrays (Clonetech, Palo Alto, Calif.) and those marked by an& symbol were identified as being progesterone regulated on using bothAtlas™ Human cDNA Expression Arrays and Affyetrix HuGeneFL Array™ chips(Affymetrix, Inc., Santa Clara, Calif.), all others were identifiedusing Affymetrix HuGeneFL Array™ chips (Affymetrix, Inc., Santa Clara,Calif.). The present inventors have categorized genes regulated byprogesterone in this study into functional categories based on GeneCardinformation as well as extensive literature reviews of each gene product(Table 17). Ten of the genes found to be regulated by progesterone inthe present study have previously been reported by other groups to beprogesterone responsive in either breast cancer cells or other hormoneresponsive cell types or tissues (Table 8). However, the PR-A and/orPR-B isoform specificity of these genes was unknown prior to the presentinvention. The independent identification of genes that have previouslybeen reported to be progesterone-regulated serves as an internal controland also demonstrates the sensitivity of this assay, as even genesinduced by progesterone as little as 1.9 fold were detected on thearrays. Additionally, 8 of the genes found to be regulated byprogesterone in the present study have previously been reported to beinvolved in either breast cancer or mammary gland development (Table16).

[0178] The average differences indicating relative intensities obtainedfrom triplicate experiments from T47D-YA and T47D-YB cell lines andduplicate experiments in the PR-negative T47D-Y cells were entered intoGeneSpring™ 3.2.12 (Silicon Genetics, San Carlos, Calif.). To normalizefor variation among chips each gene intensity value was normalized to 1(intensity of gene A on chip X divided by the median of all intensitiesmeasured on chip X). To identify patterns of gene expression among celllines and hormone treatments, k-means clustering was performed.Clustergrams of various patterns of gene regulation were generated.Within these clusters, any one gene can be viewed individually andstandard error bars generated from replicate experiments are shown forgene expression levels in cell lines containing either PR-A, PR-B, or noPR, with or without progesterone treatment. A cluster of genes was shownto be upregulated by progesterone in both PR-A and PR-B containingcells, but not in the PR-negative cell line. While most of these geneswere upregulated by progesterone treatment more strongly via PR-B, some,such as S100P calcium binding protein, and Grb10 are upregulated equallywell via PR-A and PR-B. Upregulation of IkappaBalpha via both receptorswas confirmed at the protein level as early as 6 hours, and remainedelevated for up to 48 hours in the presence of progesterone (data notshown). Additionally, the gene encoding Ezrin, identified as beingprogesterone regulated using Atlas™ Human cDNA Expression Arrays probedwith RNA from T47D-YA and YB cells left untreated or treated withprogesterone for 12 hrs was confirmed to be equally well upregulated byboth PR-A and PR-B at 12, 24 and 48 hrs by northern blot analysis (datanot shown).

[0179] The present inventors have demonstrated that although some genes(and their protein products) are regulated by progesterone through bothPR isoforms, many genes are uniquely regulated by either PR-A or PR-B.In the T47D breast cancer cell lines used for the present invention,many more genes were regulated by progesterone through PR-B than throughPR-A. However, it remains to be determined whether this situation isreversed in other types of cells or tissues; the endometrium forinstance. Data from knock-out mice show that PR-A, but not PR-B, playsan important role in opposing the proliferative effect of estrogen onthe endometrium. This is one example of tissue and PR isoformspecificity (Mulac-Jericevic et al., Science 289, 1751-4 (2000)).

[0180] Many progesterone regulated genes require PR-B as illustrated byTables 3, 4, 11, 12 and 18. Two examples are Stat5a and C/EBP beta.Their differential upregulation only by PR-B was confirmed by immunoblotat several time-points after progesterone treatment (data not shown). Incontrast, the same western blot probed for two control proteins, p21 andcyclin D1, previously reported to be progesterone regulated (Musgrove etal., Mol. Cell. Biol. 13, 3577-3587 (1993); Musgrove et al., Mol.Endocrinol. 11, 54-66 (1997); Groshong et al., Mol Endocrinol 11,1593-607 (1997)), showed them to be equally well regulated by eitherPR-A or PR-B. The gene encoding tissue factor is also uniquely regulatedby PR-B. This too was confirmed by RT-PCR. Similarly, RT-PCR confirmedthat integrin alpha 6 is uniquely regulated by PR-B at 6, 12, and 24hours after progesterone treatment. To demonstrate the differentialregulation of this gene by PR-B in a different cell line and bydifferent methods, the present inventors transfected the integrin alpha6 promoter linked to luciferase into progesterone treated PR-negativeHeLa cells that were cotransfected with either PR-B or PR-A.Transcription of the integrin alpha 6 promoter was induced 4.4 fold byPR-B, but was not regulated at all by PR-A, or by cells lacking PR (notshown).

[0181] Fewer genes were uniquely regulated by PR-A (Table 19) and theytended to be expressed at relatively low levels. The gene encoding thedocking protein enhancer of filamentation was significantly upregulatedonly by PR-A. The gene encoding the estrogen related receptor (ERR),which can heterodimerize with ERα and Erβ is also PR-A dependent. Thepreferential upregulation of ERR by PR-A was confirmed by RT-PCR at both6 and 12 hrs of progesterone treatment. The anti-apoptosis inducingprotein Bcl-X_(L), is another gene uniquely regulated by PR-A asconfirmed by RT-PCR (not shown).

[0182] In general, fewer genes were downregulated by progesteronetreatment than were upregulated (Tables 18 and 19). Analysis of pairwisecomparisons using MicroArray Suite 4.0 Expression Analysis Program™ wasused to demonstrate the statistical significance of the downregulation(in 3 out of 3 experiments). Similarly, gene filtering using GeneSpring™generated a clustergram of downregulated genes (data not shown)confirming the accuracy of the assignments. Of the downregulated genes,three were downregulated by both PR-A and PR-B; eleven were uniquelydownregulated by PR-B; and two were uniquely downregulated by PR-A.Downregulation of three of these genes, monocyte chemotactic protein,bullous pemphigoid antigen, and transforming growth factor-beta 3(TGF-beta 3) was confirmed by RT-PCR (data not shown).

[0183] Several genes that were identified by the present inventors asbeing regulated by progesterone, were previously known to be importantin breast cancers. Based on the present invention they may now betargeted for specific progestin therapies. (1) For instance, S100Pcalcium-binding protein overexpression is associated withimmortalization of human breast epithelial cells in vitro and with earlystages of breast cancer development in vivo (Guerreiro da Silva et al.,Int J Oncol 16, 231-40 (2000)). (2) The gene encoding tissue factor, acell surface glycoprotein, is associated with metastasis in breast andother types of cancers (Ueno et al., Br J Cancer 83, 164-70 (2000);Lwaleed et al., J Pathol 187(3):291-4 (1999)). Tissue factor waspreviously known to be regulated by progesterone in the endometrium(Krikun et al., Mol Endocrinol 14, 393-400 (2000); Lockwood et al., JClin Endocrinol Metab 85, 297-301 (2000); Krikun et al., J ClinEndocrinol Metab 83, 926-30 (1998)), but not in the breast or in breastcancers. (3) The gene encoding Gas6, a ligand for the tyrosine kinasereceptor Axl receptor tyrosine kinase (RTK) and other members of the RTKfamily, was recently reported to be mitogenic in breast cancer cells(Goruppi et al., Mol Cell Biol 21, 902-915 (2001)) and it promotesangiogenesis (Fridell et al., J Biol Chem 273, 7123-6. (1998)). (4) TheHEF1 gene is highly related to BCAR1/p130Cas, which has been found to beupregulated in tamoxifen resistant tumors (van der Flier et al., Int JCancer 89, 465-8 (2000); van der Flier et al., J Natl Cancer Inst 92,120-7 (2000)). The present invention provides the rationale formeasuring the expression levels of these genes in breast cancers. It maybe that tumors that overexpress these genes good candidates forsuppressive therapy with progesterone antagonists.

[0184] Additionally the inventors now demonstrate the progesteroneregulation of several genes previously known to be preferentiallyexpressed in normal breast epithelium compared to breast cancers. Forinstance, the gene encoding bullous pemphigoid antigen, a proteinassociated with hemidesmosomes, is overexpressed 12-fold in normalbreast cells compared to breast tumors (Nacht et al., Cancer Res 59,5464-70 (1999)). Such desmosomes are important in maintaining the normaldifferentiated architecture of the breast. The present inventors havefound that bullous pemphigoid antigen is downregulated by progesteronethrough both PR isoforms. This down regulation may be harmful, and/or itmay disrupt important cell-cell interactions. It is possible thatantiprogestin therapy would prevent this downregulation.

[0185] Some of the genes that were discovered by the present inventorsto be progesterone regulated are involved in particular functionalpathways. Groups of temporally regulated genes are often involved in thesame pathway. For example, it was previously known that progesteroneregulates genes involved in the steroid biosynthesis and traffickingpathways (Watari et al., Exp Cell Res 259, 247-56 (2000); Darnel et al.,J Steroid Biochem Mol Biol 70:203-10 (1999); Arcuri et al.,Endocrinology 137:595-600 (1996)), and the present investigatorsidentify a cluster of such genes. However, less is known about the roleof progesterone in regulating signaling pathways controlled by growthfactors and cytokines. The present inventors' data demonstrate for thefirst time, that progesterone plays an important role in regulating manygenes involved in these signaling pathways. In addition, the presentinventors' demonstrate that progesterone regulates expression of genesfor proteins previously known to interact with PR. Examples are FKB54(Kester et al., J Biol Chem 272, 16637-43 (1997)), Stat5 (Richeretal., JBiol Chem 273, 31317-26 (1998)), IκBα and cytoplasmic dynein light chain1 (Crepieux et al., Mol Cell Biol 17:7375-85 (1997)). TABLE 1 Genesselectively upregulated by PR-A Accession No. Fold Increase Gene NameL43821 4.7 enhancer of filamentation (HEF1) L38487 2.3 estrogenreceptor-related protein (hERRa1)

[0186] TABLE 2 Genes selectively downregulated by PR-A Accession No.Fold Decrease Gene Name U44103 −2.9 small GTP binding protein Rab9

[0187] TABLE 3 Genes selectively upregulated by PR-B. Accession Fold No.Increase Gene Name L13720 ˜23.1 growth arrest-specific protein (gas6)M27436 ˜18.1 tissue factor gene D79990 10.2 KIAA0168 Ras association(RalGDS/AF-6) domain family 2 (RASSF2) U01120 ˜9.8 glucose-6-phosphataseD25539 ˜8 KIAA0040 gene U37546 ˜7.2 IAP homolog C (MIHC) D87953 6.8 RTP,DRG1, CAP43 M76180 ˜6.5 aromatic amino acid decarboxylase (ddc) M77140˜6 pro-galanin D50840 ˜5.6 ceramide glucosyltransferase HG2743- ˜5.1Caldesmon 1 Non-Muscle HT2846 U76421 ˜4.7 dsRNA adenosine deaminaseDRADA2b U40572 4.6 beta2-syntrophin (SNT B2) S69189 ˜4.5 peroxisomalacyl-coenzyme A oxidase U44754 4.4 PSE-binding factor PTF gamma subunitU02081 4.1 guanine nucleotide regulatory protein (NET1) oncogene¹ D16227˜4 BDP-1 (member of the recoverin family) D17793 ˜4 3-alphahydroxysteroid dehydrogenase type IIb U83461 3.7 putative copper uptakeprotein (hCTR2) M23254 3.6 Ca2+-activated neutral protease (CANP) D150503.6 transcription factor AREB6 HG2167- ˜3.5 Protein Kinase Ht31,Camp-Dependent HT2237 D10040 3.5 long-chain acyl-CoA synthetase D318873.5 KIAA0062 gene X60673 3.4 adenylate kinase 3 U45878 ˜3.3 inhibitor ofapoptosis protein 1 L09229 3.3 long-chain acyl-coenzyme A synthetase(FACL1) U09646 3.2 carnitine palmitoyltransferase II precursor (CPT1)D31716 3.2 GC box bindig protein M37400 3.1 cytosolic aspartateaminotransferase X59834 3.1 glutamine synthase D78335 3.1 uridinemonophosphate kinase (UMPK) U41387 3 RNA helicase II/Gu) U07919 3aldehyde dehydrogenase 6 M69013 2.9 guanine nucleotide-bindingregulatory protein (G-y-alpha)¹ HG2530- 2.9 Adenylyl Cyclase-AssociatedProtein 2 HT2626 U79288 2.8 clone 23682 D10704 2.6 choline kinase Y081342.6 ASM-like phosphodiesterase 3b U33632 2.6 two P-domain K+ channelTWIK-1 M21154 2.5 S-adenosylmethionine decarboxylase U77949 2.5Cdc6-related protein (HsCDC6) M95767 ˜2.5 di-N-acetylchitobiase D837812.5 KIAA0197 gene X98534 2.5 vasodilator-stimulated phosphoprotein(VASP) X53586 2.5 Integrin α 6* D80001 2.4 KIAA0179 gene L18960 2.4protein synthesis factor (elF-4C) D23673 2.3 insulin receptorsubstrate-1 (IRS-1) J02888 2.3 quinone oxidoreductase (NQO2) D63487 2.3KIAA0153 gene U14603 2.3 protein-tyrosine phosphatase (HU-PP-1) L418872.3 splicing factor, arginine/serine-rich 7 (SFRS7) M92287 2.2 cyclin D3(CCND3) X61123 2.2 BTG1 M95929 2.1 homeobox protein (PHOX1) U32944 2.1cytoplasmic dynein light chain 1 (hdlc1) D79994 2.1 KIAA0172 gene(similar to ankyrin) D89377 2 MSX-2 U90878 2 LIM domain protein CLP-36U97105 2 N2A3 dihydropyrimidinase related protein-2 L40379 2 thyroidreceptor interactor (TRIP10) J05459 1.9 glutathione transferase M3(GSTM3) L42542 1.8 RLIP76 (ralA binding protein 1) D42047 1.7 KIAA0089similar to glycerol-3- phosphate dehydrogenase 1 M84349 1.7transmembrane protein (CD59) D43950 1.6 KIAA0098 T-COMPLEX PROTEIN 1(TCP-1- EPSILON) M15796 1.6 proliferating cell nuclear antigen (PCNA)

[0188] TABLE 4 Genes selectively downregulated by PR-B Accession No.Fold Decrease Gene Name U07225 ˜−4.3 P2U nucleotide receptor M27492˜−3.4 interleukin 1 receptor mRNA Y08682 −3.1 carnitinepalmitoyltransferase|type| U29091 ˜−2.9 selenium-binding protein (hSBP)X79683 −2.6 beta2 laminin. AB000220 −2.6 semaphorin E¹ HG2197-HT2267˜−2.5 Collagen, Type Vii, Alpha 1 U65011 ˜−2.5 preferentially expressedantigen of melanoma (PRAME) M18391 ˜−2.3 tyrosine kinase receptor (eph)X71874 −1.9 proteasome-like subunit MECL-1

[0189] TABLE 5 Genes up or downregulated by both PR-A and PR-B AccessionNo. Fold Gene Name X51521 ˜22.6 Ezrin* U70663 ˜7.5 zinc fingertranscription factor EZF U16799 6.1 Na, K-ATPase beta-1 subunit X656143.6 calcium-binding protein S100P D86962 2.9 Grb10 S81914 2.6 IEX-1 =radiation-inducible immediate-early U00115 2.4 bcl-6 M69225 ˜−3.5bullous pemphigoid antigen (plakin family) U90907 −3.2 clone 23907M92357 −2.1 tumor necrosis factor alpha-induced protein 2 (B94)

[0190] TABLE 6 Gene that is reciprocally regulated (upregulated by PR-B,downregulated by PR-A) Accession No. Fold Gene Name X53586 2.5 Integrinα 6*

[0191] TABLE 7 Group of genes for which the expression level isdifferent depending on which isoform is present. Accession No. Fold GeneName L13720 ˜23.1 growth arrest-specific protein (gas6) D79990 10.2KIAA0168 Ras association (RalGDS/AF-6) domain family 2 (RASSF2) U01120˜9.8 glucose-6-phosphatase U37546 ˜7.2 IAP homolog C (MIHC) D87953 6.8RTP, DRG1, CAP43 M76180 ˜6.5 aromatic amino acid decarboxylase (ddc)M77140 ˜6 pro-galanin D50840 ˜5.6 ceramide glucosyltransferase HG2743-˜5.1 Caldesmon 1 Non-Muscle HT2846 U76421 ˜4.7 dsRNA adenosine deaminaseDRADA2b U40572 4.6 beta2-syntrophin (SNT B2) S69189 ˜4.5 peroxisomalacyl-coenzyme A oxidase U44754 4.4 PSE-binding factor PTF gamma subunitU02081 4.1 guanine nucleotide regulatory protein (NET1) oncogene D16227˜4 BDP-1 (member of the recoverin family) D17793 ˜4 3-alphahydroxysteroid dehydrogenase type IIb U83461 3.7 putative copper uptakeprotein (hCTR2) M23254 3.6 Ca2+-activated neutral protease (CANP) D150503.6 transcription factor AREB6 HG2167- ˜3.5 Protein Kinase Ht31,Camp-Dependent HT2237 D10040 3.5 long-chain acyl-CoA synthetase D318873.5 KIAA0062 gene X60673 3.4 adenylate kinase 3 U45878 ˜3.3 inhibitor ofapoptosis protein 1 L09229 3.3 long-chain acyl-coenzyme A synthetase(FACL1) U09646 3.2 carnitine palmitoyltransferase II precursor (CPT1)D31716 3.2 GC box bindig protein M37400 3.1 cytosolic aspartateaminotransferase X59834 3.1 glutamine synthase D78335 3.1 uridinemonophosphate kinase (UMPK) U41387 3 RNA helicase II/Gu) U07919 3aldehyde dehydrogenase 6 M69013 2.9 guanine nucleotide-bindingregulatory protein (G-y-alpha) HG2530- 2.9 Adenylyl Cyclase-AssociatedProtein 2 HT2626 U79288 2.8 clone 23682 D10704 2.6 choline kinase Y081342.6 ASM-like phosphodiesterase 3b U33632 2.6 two P-domain K+ channelTWIK-1 M21154 2.5 S-adenosylmethionine decarboxylase U77949 2.5Cdc6-related protein (HsCDC6) M95767 ˜2.5 di-N-acetylchitobiase D837812.5 KIAA0197 gene X98534 2.5 vasodilator-stimulated phosphoprotein(VASP) D80001 2.4 KIAA0179 gene L18960 2.4 protein synthesis factor(elF-4C) D23673 2.3 insulin receptor substrate-1 (IRS-1) J02888 2.3quinone oxidoreductase (NQO2) D63487 2.3 KIAA0153 gene U14603 2.3protein-tyrosine phosphatase (HU-PP-1) L41887 2.3 splicing factor,arginine/serine-rich 7 (SFRS7) M92287 2.2 cyclin D3 (CCND3) X61123 2.2BTG1 M95929 2.1 homeobox protein (PHOX1) U32944 2.1 cytoplasmic dyneinlight chain 1 (hdlc1) D79994 2.1 KIAA0172 gene (similar to ankyrin)D89377 2 MSX-2 U90878 2 LIM domain protein CLP-36 U97105 2 N2A3dihydropyrimidinase related protein-2 L40379 2 thyroid receptorinteractor (TRIP10) J05459 1.9 glutathione transferase M3 (GSTM3) L425421.8 RLIP76 (ralA binding protein 1) D42047 1.7 KIAA0089 similar toglycerol-3- phosphate dehydrogenase 1 M84349 1.7 transmembrane protein(CD59) D43950 1.6 KIAA0098 T-COMPLEX PROTEIN 1 (TCP-1- EPSILON) M157961.6 proliferating cell nuclear antigen (PCNA) U07225 ˜−4.3 P2Unucleotide receptor M27492 ˜−3.4 interleukin 1 receptor mRNA Y08682 −3.1carnitine palmitoyltransferase I type I U29091 ˜−2.9 selenium-bindingprotein (hSBP) X79683 −2.6 beta2 laminin. AB000220 −2.6 semaphorin EHG2197- ˜−2.5 Collagen, Type Vii, Alpha 1 HT2267 U65011 ˜−2.5preferentially expressed antigen of melanoma (PRAME) M18391 ˜−2.3tyrosine kinase receptor (eph) X71874 −1.9 proteasome-like subunitMECL-1 L43821 4.7 enhancer of filamentation (HEF1) L38487 2.3 estrogenreceptor-related protein (hERRa1) D25539 ˜8 KIAA0040 gene

[0192] TABLE 8 Genes encoding products previously reported to beregulated by progesterone Accession no. Gene Name Cell or tissue typeIsoform U26726 11-beta-hydroxysteroid dehydrogenase type 2 endometrialstromal cells, Both¹ dometrial cancer cells, M27436 tissue factor geneendometrium PR-B only² U42031 progesterone receptor-associated FKBP54breast cancer cells Both³ M68516 PCI gene (plasminogen activatorinhibitor) endometrial stromal cells PR-B only⁴ U43185 Stat5A breastcancer cells PR-B only⁵ X52730 phenylethanolamine n-methyltransferase(PNMT) adrenal medulla PR-B only⁶ M69043 MAD-3 encoding IkB-alphamacrophage cells and endometrium Both⁷ AF002020 Niemann-Pick C disease(NPC1) granulosa cells PR-B only⁸ D00017 lipocortin II (calpactin I)endometrial cancer cells PR-B only⁹ D25328 platelet-typephosphofructokinase breast cancer cells, intestinal epithelium, PR-Bonly¹⁰ granulosa cells M80254 cyclophilin isoform (hCyP3) liver PR-Bonly¹¹ HG4069-HT4339_s_at Monocyte Chemotactic Protein 1 endometrialcells and breast cancer cells PR-A only¹² Z50781 delta sleep inducingpeptide (related to TSC-22) breast cancer cells PR-A only¹³

[0193] TABLE 9 Genes selectively upregulated by PR-A Accession No. FoldIncrease Gene Name L43821 4.7 enhancer of filamentation (HEF1) Z231153.2 Bcl-x* Z50781 2.5 delta sleep inducing peptide (higly related toTSC-22) L38487 2.3 estrogen receptor-related protein (hERRa1)

[0194] TABLE 10 Genes selectively downregulated by PR-A Accession No.Fold Decrease Gene Name HG4069-HT4339 ˜−7.4 Monocyte Chemotactic Protein1 U44103 −2.8 small GTP binding protein Rab9

[0195] TABLE 11 Genes selectively upregulated by PR-B Accession No. FoldIncrease Gene Name L13720 ˜23.1 growth arrest-specific protein (gas6)M27436 ˜18.1 tissue factor gene D79990 10.2 KIAA0168 Ras association(RaIGDS/AF-6) domain family 2 (RASSF2) U01120 ˜9.8 glucose-6-phosphataseD25539 ˜8 KIAA0040 gene U37546 ˜7.2 IAP homolog C (MIHC) D87953 6.8 RTP,DRG1, CAP43 M76180 ˜6.5 aromatic amino acid decarboxylase (ddc) M836676.4 NF-IL6 (C/EBPbeta) M68516 ˜6.2 PCI gene (plasminogen activatorinhibitor 3) U43185 ˜6.1 Stat5A M77140 ˜6 pro-galanin D50840 ˜5.6ceramide glucosyltransferase HG2743-HT2846 ˜5.1 Caldesmon 1 Non-MuscleU76421 ˜4.7 dsRNA adenosine deaminase DRADA2b U40572 4.6beta2-syntrophin (SNT B2) S69189 ˜4.5 peroxisomal acyl-coenzyme Aoxidase U44754 4.4 PSE-binding factor PTF gamma subunit X52730 4.4phenylethanolamine n-methyltransferase (PNMT) U02081 4.1 guaninenucleotide regulatory protein (NET1) oncogene¹ D16227 ˜4 BDP-1 (memberof the recoverin family) D17793 ˜4 3-alpha hydroxysteroid dehydrogenasetype IIb U83461 3.7 putative copper uptake protein (hCTR2) M23254 3.6Ca2+-activated neutral protease (CANP) D15050 3.6 transcription factorAREB6 HG2167-HT2237 ˜3.5 Protein Kinase Ht31, Camp-Dependent D10040 3.5long-chain acyl-CoA synthetase D31887 3.5 KIAA0062 gene X60673 3.4adenylate kinase 3 U45878 ˜3.3 inhibitor of apoptosis protein 1 L092293.3 long-chain acyl-coenzyme A synthetase (FACL1) U09646 3.2 carnitinepalmitoyltransferase II precursor (CPT1) D31716 3.2 GC box bindigprotein M37400 3.1 cytosolic aspartate aminotransferase X59834 3.1glutamine synthase D78335 3.1 uridine monophosphate kinase (UMPK) U413873 RNA helicase II/Gu) U07919 3 aldehyde dehydrogenase 6 M69013 2.9guanine nucleotide-binding regulatory protein (G-y-alpha)¹ HG2530-HT26262.9 Adenylyl Cyclase-Associated Protein 2 U79288 2.8 clone 23682 D107042.6 choline kinase Y08134 2.6 ASM-like phosphodiesterase 3b U33632 2.6two P-domain K+ channel TWIK-1 M21154 2.5 S-adenosylmethioninedecarboxylase U77949 2.5 Cdc6-related protein (HsCDC6) M95767 ˜2.5di-N-acetylchitobiase D83781 2.5 KIAA0197 gene X98534 2.5vasodilator-stimulated phosphoprotein (VASP) X53586 2.5 Integrin α 6*D80001 2.4 KIAA0179 gene L18960 2.4 protein synthesis factor (elF-4C)D23673 2.3 insulin receptor substrate-1 (IRS-1) J02888 2.3 quinoneoxidoreductase (NQO2) D63487 2.3 KIAA0153 gene U14603 2.3protein-tyrosine phosphatase (HU-PP-1) L41887 2.3 splicing factor,arginine/serine-rich 7 (SFRS7) M92287 2.2 cyclin D3 (CCND3) X61123 2.2BTG1 AF002020 2.1 Niemann-Pick C disease (NPC1) M95929 2.1 homeoboxprotein (PHOX1) U32944 2.1 cytoplasmic dynein light chain 1 (hdlc1)D79994 2.1 KIAA0172 gene (similar to ankyrin) D89377 2 MSX-2 U90878 2LIM domain protein CLP-36 U97105 2 N2A3 dihydropyrimidinase relatedprotein-2 L40379 2 thyroid receptor interactor (TRIP10) D00017 1.9lipocortin II J05459 1.9 glutathione transferase M3 (GSTM3) D25328 1.9platelet-type phosphofructokinase M80254 1.9 cyclophilin isoform (hCyP3)L42542 1.8 RLIP76 (ralA binding protein 1) D42047 1.7 KIAA0089 similarto glycerol-3-phosphate dehydrogenase 1 M84349 1.7 transmembrane protein(CD59) D43950 1.6 KIAA0098 T-COMPLEX PROTEIN 1 (TCP-1-EPSILON) M157961.6 proliferating cell nuclear antigen (PCNA)

[0196] TABLE 12 Genes selectively downregulated by PR-B Accession No.Fold Decrease Gene Name U07225 ˜−4.3 P2U nucleotide receptor M27492˜−3.4 interleukin 1 receptor mRNA Y08682 −3.1 carnitinepalmitoyltransferase I type I U29091 ˜−2.9 selenium-binding protein(hSBP) X79683 −2.6 beta2 laminin. AB000220 −2.6 semaphorin E¹HG2197-HT2267 ˜−2.5 Collagen, Type Vii, Alpha 1 U65011 ˜−2.5preferentially expressed antigen of melanoma (PRAME) M18391 ˜−2.3tyrosine kinase receptor (eph) X71874 −1.9 proteasome-like subunitMECL-1

[0197] TABLE 13 Genes up or downregulated by progesterone via both PR-Aand PR-B Accession No. Fold Gene Name U26726 ˜22.611-beta-hydroxysteroid dehydrogenase type 2 X51521 12.7 Ezrin* U420319.4 progesterone receptor-associated FKBP54¹ U70663 ˜7.5 zinc fingertranscription factor EZF U16799 6.1 Na, K-ATPase beta-1 subunit M690434.2 MAD-3 (IkB-alpha) X65614 3.6 calcium-binding protein S100P D869622.9 Grb10 S81914 2.6 IEX-1 = radiation-inducible immediate-early U001152.4 bcl-6 M69225 ˜−3.5 bullous pemphigoid antigen (plakin family) U90907−3.2 clone 23907 J03241 ˜−3 transforming growth factor-beta 3(TGF-beta3) M92357 −2.1 tumor necrosis factor alpha-induced protein 2(B94)

[0198] TABLE 14 Gene that is reciprocally regulated (upregulated byPR-B, downregulated by PR-A) Accession No. Fold Gene Name X53586 2.5Integrin α 6*

[0199] TABLE 15 Group of genes for which the expression level isdifferent depending on which isoform is present. Accession No. Fold GeneName L13720 ˜23.1 growth arrest-specific protein (gas6) M27436 ˜18.1tissue factor gene D79990 10.2 KIAA0168 Ras association (RaIGDS/AF-6)domain family 2 (RASSF2) U01120 ˜9.8 glucose-6-phosphatase U37546 ˜7.2IAP homolog C (MIHC) D87953 6.8 RTP, DRG1, CAP43 M76180 ˜6.5 aromaticamino acid decarboxylase (ddc) M77140 ˜6 pro-galanin D50840 ˜5.6ceramide glucosyltransferase HG2743-HT2846 ˜5.1 Caldesmon 1 Non-MuscleU76421 ˜4.7 dsRNA adenosine deaminase DRADA2b U40572 4.6beta2-syntrophin (SNT B2) S69189 ˜4.5 peroxisomal acyl-coenzyme Aoxidase U44754 4.4 PSE-binding factor PTF gamma subunit U02081 4.1guanine nucleotide regulatory protein (NET1) oncogene D16227 ˜4 BDP-1(member of the recoverin family) D17793 ˜4 3-alpha hydroxysteroiddehydrogenase type IIb U83461 3.7 putative copper uptake protein (hCTR2)M23254 3.6 Ca2+-activated neutral protease (CANP) D15050 3.6transcription factor AREB6 HG2167-HT2237 ˜3.5 Protein Kinase Ht31,Camp-Dependent D10040 3.5 long-chain acyl-CoA synthetase D31887 3.5KIAA0062 gene X60673 3.4 adenylate kinase 3 U45878 ˜3.3 inhibitor ofapoptosis protein 1 L09229 3.3 long-chain acyl-coenzyme A synthetase(FACL1) U09646 3.2 carnitine palmitoyltransferase II precursor (CPT1)D31716 3.2 GC box bindig protein M37400 3.1 cytosolic aspartateaminotransferase X59834 3.1 glutamine synthase D78335 3.1 uridinemonophosphate kinase (UMPK) U41387 3 RNA helicase II/Gu) U07919 3aldehyde dehydrogenase 6 M69013 2.9 guanine nucleotide-bindingregulatory protein (G-y-alpha) HG2530-HT2626 2.9 AdenylylCyclase-Associated Protein 2 U79288 2.8 clone 23682 D10704 2.6 cholinekinase Y08134 2.6 ASM-like phosphodiesterase 3b U33632 2.6 two P-domainK+ channel TWIK-1 M21154 2.5 S-adenosylmethionine decarboxylase U779492.5 Cdc6-related protein (HsCDC6) M95767 ˜2.5 di-N-acetylchitobiaseD83781 2.5 KIAA0197 gene X98534 2.5 vasodilator-stimulatedphosphoprotein (VASP) D80001 2.4 KIAA0179 gene L18960 2.4 proteinsynthesis factor (elF-4C) D23673 2.3 insulin receptor substrate-1(IRS-1) J02888 2.3 quinone oxidoreductase (NQO2) D63487 2.3 KIAA0153gene U14603 2.3 protein-tyrosine phosphatase (HU-PP-1) L41887 2.3splicing factor, arginine/serine-rich 7 (SFRS7) M92287 2.2 cyclin D3(CCND3) X61123 2.2 BTG1 M95929 2.1 homeobox protein (PHOX1) U32944 2.1cytoplasmic dynein light chain 1 (hdlc1) D79994 2.1 KIAA0172 gene(similar to ankyrin) D89377 2 MSX-2 U90878 2 LIM domain protein CLP-36U97105 2 N2A3 dihydropyrimidinase related protein-2 L40379 2 thyroidreceptor interactor (TRIP10) J05459 1.9 glutathione transferase M3(GSTM3) L42542 1.8 RLIP76 (ralA binding protein 1) D42047 1.7 KIAA0089similar to glycerol-3-phosphate dehydrogenase 1 M84349 1.7 transmembraneprotein (CD59) D43950 1.6 KIAA0098 T-COMPLEX PROTEIN 1 (TCP-1-EPSILON)M15796 1.6 proliferating cell nuclear antigen (PCNA) U07225 ˜−4.3 P2Unucleotide receptor M27492 ˜−3.4 interleukin 1 receptor mRNA Y08682 −3.1carnitine palmitoyltransferase I type I U29091 ˜−2.9 selenium-bindingprotein (hSBP) X79683 −2.6 beta2 laminin. AB000220 −2.6 semaphorin EHG2197-HT2267 ˜−2.5 Collagen, Type Vii, Alpha 1 U65011 ˜−2.5preferentially expressed antigen of melanoma (PRAME) M18391 ˜−2.3tyrosine kinase receptor (eph) X71874 −1.9 proteasome-like subunitMECL-1 L43821 4.7 enhancer of filamentation (HEF1) L38487 2.3 estrogenreceptor-related protein (hERRa 1) D25539 ˜8 KIAA0040 gene HG4069-HT4339˜−7.4 Monocyte Chemotactic Protein 1

[0200] TABLE 16 Genes encoding products involved in breast cancer ormammary gland development*. Accession no. Fold Gene Name L13720 ˜23.1growth arrest-specific protein (gas6) M27436 ˜18.1 tissue factor geneM83667 6.4 NF-IL6-beta (C/EBPbeta)* M68516 ˜6.2 PCI gene (plasminogenactivator inhibitor) U43185 ˜6.1 Stat5A* X65614 3.6 calcium-bindingprotein S100P X53586 2.5 Integrin α 6* D89377 2 MSX-2* D00017 1.9lipocortin II (calpactin I) U29091 ˜−2.9 selenium-binding protein (hSBP)M69225 ˜−3.5 bullous pemphigoid antigen (plakin family)

REFERENCES

[0201] 1. Goruppi et al., Mol Cell Biol, 21:902-915 (2001)

[0202] 2. Ueno et al., Br J Cancer, 83:164-70 (2000); Lwaleed et al., JPathol, 187:291-4 (1999); Lwaleed et al., J Pathol, 188(1):3-8 (1999)

[0203] 3. Seagroves et al., Mol Endocrinol, 14(3):359-68 (2000);Robinson et al., Genes Dev, 12(12):1907-16 (1998); Seagroves et al.,Genes Dev, 12(12):1917-1928 (1998)

[0204] 4. Nelson et al., J Natl Cancer Inst, 92(11):866-8 (2000)

[0205] 5. Liu et al., Genes Dev, 11(2):179-86 (1997); Watson et al., BrJ Cancer, 71(4):840-844 (1995)

[0206] 6. Guerreiro de Silva et al., Int J Oncol, 16:231-40 (2000)

[0207] 7. Wewer et al., Am J Pathol, 151(5):1191-8 (1997); Tagliabue etal., Eur J Cancer, 34(12):1982-3 (1998)

[0208] 8. Phippard et al., Development, 122(9):2729-37 (1996); Friedmannet al., Dev Biol, 177(1):347-55 (1996)

[0209] 9. Mai et al., Biochim Biophys Acta, 1477(1-2):215-30 (2000)

[0210] 10. Vinceti et al., Tumori 86(2):105-18 (2000); Jiang et al., MolCarcinog, 26(4):213-25 (1999)

[0211] 11. Nacht et al., Cancer Res, 59:5464-70 (1999) TABLE 17 Genesregulated by progesterone organized by primary function of gene product.Accession no. Fold Gene Name Regulation Pattern Transcription factorsU70663 ˜7.5 zinc finger transcription factor EZF Up by Both M83667 6.4NF-IL6 (C/EBPbeta) Up by PR-B U43185 ˜6.1 Stat5A Up by PR-B D15050 3.6transcription factor AREB6 Up by PR-B D31716 3.2 GC box bindig proteinUp by PR-B U00115 2.4 bcl-6 Up by Both U44754 4.4 PSE-binding factor PTFgamma subunit Up by PR-B M95929 2.1 homeobox protein (PHOX1) Up by PR-BS81914 2.6 IEX-1 = radiation-inducible DIF2 Up by Both D89377 2 MSX-2 Upby PR-B Z50781 2.5 delta sleep inducing peptide (higly related toTSC-22) Up by PR-A L38487 2.3 estrogen receptor-related protein (hERRa1)Up by PR-A Cell adhesion or cytoskeleton interaction HG2743-HT2846 ˜5.1Caldesmon 1 Non-Muscle Up by PR-B L43821 4.7 enhancer of filamentation(HEF1) Up by PR-A U40572 4.6 beta2-syntrophin (SNT B2) Up by PR-B X985342.5 vasodilator-stimulated phosphoprotein (VASP) Up by PR-B U32944 2.1cytoplasmic dynein light chain 1 (hdlc1) Up by PR-B U90878 2 LIM domainprotein CLP-36 Up by PR-B X79683 −2.6 beta2 laminin. Down by PR-B L438214.7 enhancer of filamentation (HEF1) Up by PR-A Calcium binding proteinsD16227 ˜4 BDP-1 (member of the recoverin family) Up by PR-B X65614 3.6calcium-binding protein S100P Up by Both D00017 1.9 lipocortin II(calpactin I) Up by PR-B Cholesterol or steroid metabolism andtrafficking U26726 ˜22.6 11-beta-hydroxysteroid dehydrogenase type 2 Upby Both D17793 ˜4 3-alpha hydroxysteroid dehydrogenase type IIb Up byPR-B AF002020 2.1 Niemann-Pick C disease (NPC1) Up by PR-B Fatty acid/lipid metabolism M76180 ˜6.5 aromatic amino acid decarboxylase (ddc) Upby PR-B D50840 ˜5.6 ceramide glucosyltransferase (phospholipidsynthesis) Up by PR-B S69189 ˜4.5 peroxisomal acyl-coenzyme A oxidase Upby PR-B X52730 4.4 phenylethanolamine n-methyltransferase (PNMT) Up byPR-B L09229 3.3 long-chain acyl-coenzyme A synthetase (FACL1) Up by PR-BU09646 3.2 carnitine palmitoyltransferase II precursor (CPT1) Up by PR-BX59834 3.1 glutamine synthase Up by PR-B D78335 3.1 uridinemonophosphate kinase (UMPK) Up by PR-B Y08134 2.6 ASM-likephosphodiesterase 3b Up by PR-B J02888 2.3 quinone oxidoreductase (NQO2)Up by PR-B Y08682 −3.1 carnitine palmitoyltransferase I type I down byPR-B Nucleotide or amino acid metabolism M37400 3.1 cytosolic aspartateaminotransferase (amino acid metabolism) Up by PR-B U97105 2 N2A3dihydropyrimidinase related protein-2 Up by PR-B U07225 ˜−4.3 P2Unucleotide receptor down by PR-B General metabolic/ synthetic U01120˜9.8 glucose-6-phosphatase (gluconeogenesis) Up by PR-B U07919 3aldehyde dehydrogenase 6 (alcohol metabolism) Up by PR-B M21154 2.5S-adenosylmethionine decarboxylase (polyamine Up by PR-B biosynthesis)M95767 ˜2.5 di-N-acetylchitobiase (glycoprotein synthesis) Up by PR-BD42047 1.7 KIAA0089 gene (similar to glycerol-3-phosphate Up by PR-Bdehydrogenase 1) J05459 1.9 glutathione transferase M3 (GSTM3) Up byPR-B D25328 1.9 platelet-type phosphofructokinase Up by PR-B U29091˜−2.9 selenium-binding protein (hSBP) down by PR-B DNA-replication/transcription/ translation and protein processing U76421 ˜4.7 dsRNAadenosine deaminase DRADA2b Up by PR-B U41387 3 RNA helicase II/Gu Up byPR-B L18960 2.4 protein synthesis factor (elF-4C) Up by PR-B L41887 2.3splicing factor, arginine/serine-rich 7 (SFRS7) Up by PR-B U77949 2.5Cdc6-related protein (HsCDC6) Up by PR-B X71874 −1.9 proteasome-likesubunit MECL-1 Down by PR-B Secreted molecules L13720 ˜23.1 growtharrest-specific protein (gas6) Up by PR-B M27436 ˜18.1 tissue factorgene Up by PR-B M68516 ˜6.2 PCI gene (plasminogen activator inhibitor 3)Up by PR-B M77140 ˜6 pro-galanin Up by PR-B M23254 3.6 Ca2+-activatedneutral protease (CANP) Up by PR-B AB000220 −2.6 semaphorin E Down byPR-B Signal transduction D79990 10.2 KIAA0168 Ras association(RaIGDS/AF-6) domain family 2 (RASSF2) M69043 4.2 MAD-3 encodingIkB-alpha Up by Both U02081 4.1 guanine nucleotide regulatory protein(NET1) oncogene Up by PR-B HG2167-HT2237 ˜3.5 Protein Kinase Ht31,cAMP-Dependent Up by PR-B X60673 3.4 adenylate kinase 3 Up by PR-BHG2530-HT2626 2.9 Adenylyl Cyclase-Associated Protein 2 Up by PR-BD86962 2.9 Grb10 Up by Both M69013 2.9 guanine nucleotide-bindingregulatory protein (G-y-alpha) Up by PR-B D10704 2.6 choline kinase Upby PR-B U14603 2.3 protein-tyrosine phosphatase (HU-PP-1) Up by PR-BL40379 2 thyroid receptor interactor (TRIP10) Up by PR-B M18391 ˜−2.3tyrosine kinase receptor (eph) Down by PR-B U44103_at −2.8 small GTPbinding protein Rab9 Down by PR-A Cytokines/ Cytokine Receptors andChemokines M27492 ˜−3.4 interleukin 1 receptor mRNA Down by PR-B J03241˜−3 transforming growth factor-beta 3 (TGF-beta3) Down by BothHG4069-HT4339_s_at ˜−7.4 Monocyte Chemotactic Protein 1 Down by PR-AMembrane bound molecules U16799 6.1 Na,K-ATPase beta-1 subunit Up byBoth U83461 3.7 putative copper uptake protein (hCTR2) Up by PR-B U336322.6 two P-domain K+ channel TWIK-1 Up by PR-B M84349 1.7 transmembraneprotein (CD59) Up by PR-B M69225 ˜−3.5 bullous pemphigoid antigen(plakin family) Down by Both U65011 ˜−2.5 preferentially expressedantigen of melanoma (PRAME) Down by PR-B Chaperones/ Protein foldingU42031 9.4 progesterone receptor-associated FKBP54 Up by Both M80254 1.9cyclophilin isoform (hCyP3) Up by PR-B Apoptosis U37546 ˜7.2 IAP homologC (bindsTNFreceptor-associated factors) Up by PR-B U45878 ˜3.3 inhibitorof apoptosis protein 1 mRNA Up by PR-B Cell cycle D87953 6.8 RTP Up byPR-B M92287 2.2 cyclin D3 (CCND3) Up by PR-B M15796 1.6 proliferatingcell nuclear antigen (PCNA) Up by PR-B X61123 2.2 BTG1 Up by PR-BUnknown Function D25539 ˜8 KIAA0040 gene Up by PR-B D31887 3.5 KIAA0062gene Up by PR-B U79288 2.8 clone 23682 Up by PR-B D83781 2.5 KIAA0197gene Up by PR-B D80001 2.4 KIAA0179 gene Up by PR-B D63487 2.3 KIAA0153gene Up by PR-B D79994 2.1 KIAA0172 gene (similar to ankyrin) Up by PR-BM92357 −2.1 tumor necrosis factor, alpha-induced protein 2 B94 Down byPR-B U90907 −2.1 clone 23907 (similar to mouse p55PIK) Down by Both

[0212] TABLE 18 Transcripts regulated in T47D-YB cells after 6 hrsprogesterone treatment Accession no. Gene Name Fold Increase L13720˜23.1 growth arrest-specific protein (gas6) U26726 ˜22.611-beta-hydroxysteroid dehydrogenase type 2 M27436 ˜18.1 tissue factorgene D79990 10.2 KIAA0168 Ras association (RalGDS/AF-6) domain family 2(RASSF2) U01120 ˜9.8 glucose-6-phosphatase U42031 9.4 progesteronereceptor-associated FKBP54* D25539 ˜8 KIAA0040 gene U70663 ˜7.5 zincfinger transcription factor EZF U37546 ˜7.2 IAP homolog C (MIHC) D879536.8 RTP, DRG1, CAP43 M76180 ˜6.5 aromatic amino acid decarboxylase (ddc)M83667 6.4 NF-IL6 (C/EBPbeta) M68516 ˜6.2 PCI gene (plasminogenactivator inhibitor 3) U43185 ˜6.1 Stat5A U16799 6.1 Na, K-ATPase beta-1subunit M77140 ˜6 pro-galanin D50840 ˜5.6 ceramide glucosyltransferaseHG2743-HT2846 ˜5.1 Caldesmon 1 Non-Muscle U76421 ˜4.7 dsRNA adenosinedeaminase DRADA2b U40572 4.6 beta2-syntrophin (SNT B2) S69189 ˜4.5peroxisomal acyl-coenzyme A oxidase U44754 4.4 PSE-binding factor PTFgamma subunit X52730 4.4 phenylethanolamine n-methyltransferase (PNMT)M69043 4.2 MAD-3 (lkB-alpha) U02081 4.1 guanine nucleotide regulatoryprotein (NET1) oncogene* D16227 ˜4 BDP-1 (member of the recoverinfamily) D17793 ˜4 3-alpha hydroxysteroid dehydrogenase type IIb U834613.7 putative copper uptake protein (hCTR2) X65614 3.6 calcium-bindingprotein S100P M23254 3.6 Ca2+-activated neutral protease (CANP) D150503.6 transcription factor AREB6 HG2167-HT2237 ˜3.5 Protein Kinase Ht31,Camp-Dependent D10040 3.5 long-chain acyl-CoA synthetase D31887 3.5KIAA0062 gene X60673 3.4 adenylate kinase 3 U45878 ˜3.3 inhibitor ofapoptosis protein 1 L09229 3.3 long-chain acyl-coenzyme A synthetase(FACL1) U09646 3.2 carnitine palmitoyltransferase II precursor (CPT1)D31716 3.2 GC box bindig protein M37400 3.1 cytosolic aspartateaminotransferase X59834 3.1 glutamine synthase D78335 3.1 uridinemonophosphate kinase (UMPK) U41387 3 RNA helicase II/Gu) U07919 3aldehyde dehydrogenase 6 D86962 2.9 Grb10 M69013 2.9 guaninenucleotide-binding regulatory protein (G-y-alpha)* HG2530-HT2626 2.9Adenylyl Cyclase-Associated Protein 2 U79288 2.8 clone 23682 D10704 2.6choline kinase Y08134 2.6 ASM-like phosphodiesterase 3b U33632 2.6 twoP-domain K+ channel TWIK-1 S81914 2.6 IEX-1 = radiation-inducibleimmediate-early M21154 2.5 S-adenosylmethionine decarboxylase U77949 2.5Cdc6-related protein (HsCDC6) M95767 ˜2.5 di-N-acetylchitobiase D837812.5 KIAA0197 gene X98534 2.5 vasodilator-stimulated phosphoprotein(VASP) D80001 2.4 KIAA0179 gene L18960 2.4 protein synthesis factor(elF-4C) U00115 2.4 bcl-6 J02888 2.3 quinone oxidoreductase (NQO2)D63487 2.3 KIAA0153 gene U14603 2.3 protein-tyrosine phosphatase(HU-PP-1) L41887 2.3 splicing factor, arginine/serine-rich 7 (SFRS7)M92287 2.2 cyclin D3 (CCND3) X61123 2.2 BTG1 AF002020 2.1 Niemann-Pick Cdisease (NPC1) M95929 2.1 homeobox protein (PHOX1) U32944 2.1cytoplasmic dynein light chain 1 (hdlc1) D79994 2.1 KIAA0172 gene(similar to ankyrin) D89377 2 MSX-2 U90878 2 LIM domain protein CLP-36U97105 2 N2A3 dihydropyrimidinase related protein-2 L40379 2 thyroidreceptor interactor (TRIP10) D00017 1.9 lipocortin II J05459 1.9glutathione transferase M3 (GSTM3) D25328 1.9 platelet-typephosphofructokinase M80254 1.9 cyclophilin isoform (hCyP3) L42542 1.8RLIP76 (ralA binding protein 1) D42047 1.7 KIAA0089 similar toglycerol-3-phosphate dehydrogenase 1 M84349 1.7 transmembrane protein(CD59) D43950 1.6 KIAA0098 T-COMPLEX PROTEIN 1 (TCP-1-EPSILON) M157961.6 proliferating cell nuclear antigen (PCNA) Fold Decrease U07225 ˜−4.3P2U nucleotide receptor M69225 ˜−3.5 bullous pemphigoid antigen (plakinfamily) M27492 ˜−3.4 interleukin 1 receptor mRNA U90907 −3.2 clone 23907Y08682 −3.1 carnitine palmitoyltransferase I type I J03241 ˜−3transforming growth factor-beta 3 (TGF-beta3) U29091 ˜−2.9selenium-binding protein (hSBP) X79683 −2.6 beta2 laminin. AB000220 −2.6semaphorin E* HG2197-HT2267 ˜−2.5 Collagen, Type Vii, Alpha 1 U65011˜−2.5 preferentially expressed antigen of melanoma (PRAME) M18391 ˜−2.3tyrosine kinase receptor (eph) M92357 −2.1 tumor necrosis factor,alpha-induced protein 2 B94 X71874 −1.9 proteasome-like subunit MECL-1

[0213] TABLE 19 Transcripts regulated in T47D-YA cells after 6 hrsprogesterone treatment Accession no. Gene Name Fold Increase U26726 6.511-beta-hydroxysteroid dehydrogenase type 2 L43821 4.7 enhancer offilamentation (HEF1) U70663 ˜7.5 zinc finger transcription factor EZFU16799 3.9 Na, K-ATPase beta-1 subunit U42031 3.3 progesteronereceptor-associated FKBP54 Z50781 2.5 delta sleep inducing peptide(higly related to TSC-22) L38487 2.3 estrogen receptor-related protein(hERRa1) U00115 2.3 bcl-6 X65614 2.2 calcium-binding protein S100PS81914 2.1 IEX-1 = radiation-inducible immediate-early M69043 2.0 MAD-3mRNA (IkB-alpha) D86962 2.0 Grb10 Fold Decrease HG4069-HT4339 ˜−7.4Monocyte Chemotactic Protein 1 M69225 ˜−4.3 bullous pemphigoid antigen(BPAG1) J03241 −3.3 transforming growth factor-beta 3 (TGF-beta3) M92357−3.0 tumor necrosis factor, alpha-induced protein 2 B94 U44103 −2.8small GTP binding protein Rab9 U90907 −2.1 clone 23907

[0214] While various embodiments of the present invention have beendescribed in detail, it is apparent that modifications and adaptationsof those embodiments will occur to those skilled in the art. It is to beexpressly understood, however, that such modifications and adaptationsare within the scope of the present invention, as set forth in thefollowing claims.

1 108 1 22 DNA Artificial Sequence Primer 1 atccagcgta ctccaaagat tc 222 22 DNA Artificial Sequence Primer 2 tccttgctga aagacaagtc tg 22 3 3817DNA Homo sapiens 3 tgaattcgtg agagacttga gggaggcgct gcgactgacaagcggctctg cccgggacct 60 tctcgctttc atctagcgct gcactcaatg gaggggcgggcaccgcagtg cttaatgctg 120 tcttaactag tgtaggaaaa cggctcaacc caccgctgccgaaatgaagt ataagaatct 180 tatggcaagg gccttatatg acaatgtccc agagtgtgccgaggaactgg cctttcgcaa 240 gggagacatc ctgaccgtca tagagcagaa cacagggggactggaaggat ggtggctgtg 300 ctcgttacac ggtcggcaag gcattgtccc aggcaaccgggtgaagcttc tgattggtcc 360 catgcaggag actgcctcca gtcacgagca gcctgcctctggactgatgc agcagacctt 420 tggccaacag aagctctatc aagtgccaaa cccacaggctgctccccgag acaccatcta 480 ccaagtgcca ccttcctacc aaaatcaggg aatttaccaagtccccactg gccacggcac 540 ccaagaacaa gaggtatatc aggtgccacc atcagtgcagagaagcattg ggggaaccag 600 tgggccccac gtgggtaaaa aggtgataac ccccgtgaggacaggccatg gctacgtata 660 cgagtaccca tccagatacc aaaaggatgt ctatgatatccctccttctc ataccactca 720 aggggtatac gacatccctc cctcatcagc aaaaggccctgtgttttcag ttccagtggg 780 agagataaaa cctcaagggg tgtatgacat cccgcctacaaaaggggtat atgccattcc 840 gccctctgct tgccgggatg aagcagggct tagggaaaaagactatgact tcccccctcc 900 catgagacaa gctggaaggc cggacctcag accggagggggtttatgaca ttcctccaac 960 ctgcaccaag ccagcaggga aggaccttca tgtaaaatacaactgtgaca ttccaggagc 1020 tgcagaaccg gtggctcgaa ggcaccagag cctgtccccgaatcacccac ccccgcaact 1080 cggacagtca gtgggctctc agaacgacgc atatgatgtcccccgaggcg ttcagtttct 1140 tgagccacca gcagaaacca gtgagaaagc aaacccccaggaaagggatg gtgtttatga 1200 tgtccctctg cataacccgc cagatgctaa aggctctcgggacttggtgg atgggatcaa 1260 ccgattgtct ttctccagta caggcagcac ccggagtaacatgtccacgt cttccacctc 1320 ctccaaggag tcctcactgt cagcctcccc agctcaggacaaaaggctct tcctggatcc 1380 agacacagct attgagagac ttcagcggct ccagcaggcccttgagatgg gtgtctccag 1440 cctaatggca ctggtcacta ccgactggcg gtgttacggatatatggaaa gacacatcaa 1500 tgaaatacgc acagcagtgg acaaggtgga gctgttcctgaaggagtacc tccactttgt 1560 caagggagct gttgcaaatg ctgcctgcct cccggaactcatcctccaca acaagatgaa 1620 gcgggagctg caacgagtcg aagactccca ccagatcctgagtcaaacca gccatgactt 1680 aaatgagtgc agctggtccc tgaatatctt ggccatcaacaagccccaga acaagtgtga 1740 cgatctggac cggtttgtga tggtggcaaa gacggtgcccgatgacgcca agcagctcac 1800 cacaaccatc aacaccaacg cagaggccct cttcagacccggccctggca gcttgcatct 1860 gaagaatggg ccggagagca tcatgaactc aacggagtacccacacggtg gctcccaggg 1920 acagctgctg catcctggtg accacaaggc ccaggcccacaacaaggcac tgcccccagg 1980 cctgagcaag gagcaggccc ctgactgtag cagcagtgatggttctgaga ggagctggat 2040 ggatgactac gattacgtcc acctacaggg taaggaggagtttgagaggc aacagaaaga 2100 gctattggaa aaagagaata tcatgaaaca gaacaagatgcagctggaac atcatcagct 2160 gagccagttc cagctgttgg aacaagagat tacaaagcccgtggagaatg acatctcgaa 2220 gtggaagccc tctcagagcc tacccaccac aaacagtggcgtgagtgctc aggatcggca 2280 gttgctgtgc ttctactatg accaatgtga gacccatttcatttcccttc tcaacgccat 2340 tgacgcactc ttcagttgtg tcagctcagc ccagcccccgcgaatcttcg tggcacacag 2400 caagtttgtc atcctcagtg cacacaaact ggtgttcattggagacacgc tgacacggca 2460 ggtgactgcc caggacattc gcaacaaagt catgaactccagcaaccagc tctgcgagca 2520 gctcaagact atagtcatgg caaccaagat ggccgccctccattacccca gcaccacggc 2580 cctgcaggaa atggtgcacc aagtgacaga cctttctagaaatgcccagc tgttcaagcg 2640 ctctttgctg gagatggcaa cgttctgaga agaaaaaaaagaggaagggg actgcgttaa 2700 cggttactaa ggaaaactgg aaatactgtc tggtttttgtaaatgttatc tatttttgta 2760 gataatttta tataaaaatg aaatatttta acattttatgggtcagacaa ctttcagaaa 2820 ttcagggagc tggagaggga aatctttttt tcccccctgagtgttcttat gtatacacag 2880 aagtatctga gacataaact gtacagaaaa cttgtccacgtccttttgta tgcccatgta 2940 ttcatgtttt tgtttgtaga tgtttgtctg atgcatttcattaaaaaaaa aaccatgaat 3000 tacgaagcac cttagtaagc accttctaat gctgcattttttttgttgtt gttaaaaaca 3060 tccagctggt tataatattg ttctccacgt ccttgtgatgattctgagcc tggcactggg 3120 aatctgggaa gcatagttta tttgcaagtg ttcaccttccaaatcatgag gcatagcatg 3180 acttattctt gttttgaaaa ctcttttcaa aactgaccatcttaaacaca tgatggccaa 3240 gtgccacaaa gccctcttgc ggagacattt acgaatatatatgtggatcc aagtctcgat 3300 agttaggcgt tggagggaag agagaccaga gagtttagaggccaggacca cagttaggat 3360 tgggttgttt caatactgag agacagctac aataaaaggagagcaattgc ctccctgggg 3420 ctgttcaatc ttctgcattt gtgagtggtt cagtcatgaggttttccaaa agatgttttt 3480 agagttgtaa aaaccatatt tgcagcaaag atttacaaaggcgtatcaga ctatgattgt 3540 tcaccaaaat aggggaatgg tttgatccgc cagttgcaagtagaggcctt tctgactctt 3600 aatattcact ttggtgctac tacccccatt acctgaggaactggccaggt ccttgatcat 3660 ggaactatag agctaccaga catatcctgc tctctaagggaatttattgc tatcttgcac 3720 cttctttaaa actcaaaaaa catatgcaga cctgacactcaagagtggct agctacacag 3780 agtccatcta atttttgcaa cttccccccc cgaattc 38174 2218 DNA Homo sapiens 4 tcctacaagc agccggcggc gccgccgagt gaggggacgcggcgcggtgg ggcggcgcgg 60 cccgaggagg cggcggagga ggggccgccc gcggcccccggctcactccg gcactccggg 120 ccgctcggcc cccatgcctg cccgaccgcg ctgccggagccccaggtgac cagcgccatg 180 tccagccagg tggtgggcat tgagcctctc tacatcaaggcagagccggc cagccctgac 240 agtccaaagg gttcctcgga gacagagacc gagcctcctgtggccctggc ccctggtcca 300 gctcccactc gctgcctccc aggccacaag gaagaggaggatggggaggg ggctgggcct 360 ggcgagcagg gcggtgggaa gctggtgctc agctccctgcccaagcgcct ctgcctggtc 420 tgtggggacg tggcctccgg ctaccactat ggtgtggcatcctgtgaggc ctgcaaagcc 480 ttcttcaaga ggaccatcca ggggagcatc gagtacagctgtccggcctc caacgagtgt 540 gagatcacca agcggagacg caaggcctgc caggcctgccgcttcaccaa gtgcctgcgg 600 gtgggcatgc tcaaggaggg agtgcgcctg gaccgcgtccggggtgggcg gcagaagtac 660 aagcggcggc cggaggtgga cccactgccc ttcccgggccccttccctgc tgggcccctg 720 gcagtcgctg gaggcccccg gaagacagcc ccagtgaatgcactggtgtc tcatctgctg 780 gtggttgagc ctgagaagct ctatgccatg cctgaccccgcaggccctga tgggcacctc 840 ccagccgtgg ctaccctctg tgacctcttt gaccgagagattgtggtcac catcagctgg 900 gccaagagca tcccaggctt ctcatcgctg tcgctgtctgaccagatgtc agtactgcag 960 agcgtgtgga tggaggtgct ggtgctgggt gtggcccagcgctcactgcc actgcaggat 1020 gagctggcct tcgctgagga cttagtcctg gatgaagagggggcacgggc agctggcctg 1080 ggggaactgg gggctgccct gctgcaacta gtgcggcggctgcaggccct gcggctggag 1140 cgagaggagt atgttctact aaaggccttg gcccttgccaattcagactc tgtgcacatc 1200 gaagatgccg aggctgtgga gcagctgcga gaagctctgcacgaggccct gctggagtat 1260 gaagccggcc gggctggccc cggagggggt gctgagcggcggcgggcggg caggctgctg 1320 ctcacgctac cgctcctccg ccagacagcg ggcaaagtgctggcccattt ctatggggtg 1380 aagctggagg gcaaggtgcc catgcacaag ctgttcttggagatgctcga ggccatgatg 1440 gactgaggca aggggtggga ctggtggggg ttctggcaggacctgcctag catggggtca 1500 gccccaaggg ctggggcgga gctggggtct gggcagtgccacagcctgct ggcagggcca 1560 gggcaatgcc atcagcccct gggaacaggc cccacgccctctcctccccc tcctaggggg 1620 tgtcagaagc tgggaacgtg tgtccaggct ctgggcacagtgctgcccct tgcaagccat 1680 aacgtgcccc cagagtgtag ggggccttgc ggaagccatagggggctgca cgggatgcgt 1740 gggaggcaga aacctatctc agggagggaa ggggatggaggccagagtct cccagtgggt 1800 gatgcttttg ctgctgctta atcctacccc ctcttcaaagcagagtggga cttggagagc 1860 aaaggcccat gcccccttcg ctcctcctct catcatttgcattgggcatt agtgtccccc 1920 cttgaagcaa taactccaag cagactccag cccctggacccctggggtgg ccagggcttc 1980 cccatcagct cccaacgagc ctcctcaggg ggtaggagagcactgcctct atgccctgca 2040 gagcaataac actatattta tttttgggtt tggccagggaggcgcaggga catggggcaa 2100 gccagggccc agagcccttg gctgtacaga gactctattttaatgtatat ttgctgcaaa 2160 gagaaaccgc ttttggtttt aaacctttaa tgagaaaaaaatatataata ccgagctc 2218 5 606 DNA Homo sapiens 5 atggcaggaa aatcttcactttttaaagta attctccttg gagatggtgg agttgggaag 60 agttcactta tgaacagatatgtaactaat aagtttgata cccagctctt ccatacaata 120 ggtgtggaat ttttaaataaagatttggaa gtggatggac attttgttac catgcagatt 180 tgggacacgg caggtcaggagcgattccga agcctgagga caccatttta cagaggttct 240 gactgctgcc tgcttacttttagtgtcgat gattcacaaa gcttccagaa cttaagtaac 300 tggaagaaag aattcatatattatgcagat gtgaaagagc ctgagagctt tccttttgtg 360 attctgggta acaagattgacataagcgaa cggcaggtgt ctacagaaga agcccaagct 420 tggtgcaggg acaacggcgactatccttat tttgaaacaa gtgcaaaaga tgccacaaat 480 gtggcagcag cctttgaggaagcggttcga agagttcttg ctaccgagga taggtcagat 540 catttgattc agacagacacagtcaatctt caccgaaagc ccaagcctag ctcatcttgc 600 tgttga 606 6 2461 DNAHomo sapiens 6 ccgcagccgc cgccgccgcc gccgccgcga tgtgaccttc agggccgccaggacgggatg 60 accggagcct ccgccccgcg gcgcccgctc gcctcggcct cccgggcgctctgaccgcgc 120 gtccccggcc cgccatggcc ccttcgctct cgcccgggcc cgccgccctgcgccgcgcgc 180 cgcagctgct gctgctgctg ctggccgcgg agtgcgcgct tgccgcgctgttgccggcgc 240 gcgaggccac gcagttcctg cggcccaggc agcgccgcgc ctttcaggtcttcgaggagg 300 ccaagcaggg ccacctggag agggagtgcg tggaggagct gtgcagccgcgaggaggcgc 360 gggaggtgtt cgagaacgac cccgagacgg attattttta cccaagatacttagactgca 420 tcaacaagta tgggtctccg tacaccaaaa actcaggctt cgccacctgcgtgcaaaacc 480 tgcctgacca gtgcacgccc aacccctgcg ataggaaggg gacccaagcctgccaggacc 540 tcatgggcaa cttcttctgc ctgtgtaaag ctggctgggg gggccggctctgcgacaaag 600 atgtcaacga atgcagccag gagaacgggg gctgcctcca gatctgccacaacaagccgg 660 gtagcttcca ctgttcctgc cacagcggct tcgagctctc ctctgatggcaggacctgcc 720 aagacataga cgagtgcgca gactcggagg cctgcgggga ggcgcgctgcaagaacctgc 780 ccggctccta ctcctgcctc tgtgacgagg gctttgcgta cagctcccaggagaaggctt 840 gccgagatgt ggacgagtgt ctgcagggcc gctgtgagca ggtctgcgtgaactccccag 900 ggagctacac ctgccactgt gacgggcgtg ggggcctcaa gctgtcccaggacatggaca 960 cctgtgagga catcttgccg tgcgtgccct tcagcgtggc caagagtgtgaagtccttgt 1020 acctgggccg gatgttcagt gggacccccg tgatccgact gcgcttcaagaggctgcagc 1080 ccaccaggct ggtagctgag tttgacttcc ggacctttga ccccgagggcatcctcctct 1140 ttgccggagg ccaccaggac agcacctgga tcgtgctggc cctgagagccggccggctgg 1200 agctgcagct gcgctacaac ggtgtcggcc gtgtcaccag cagcggcccggtcatcaacc 1260 atggcatgtg gcagacaatc tctgttgagg agctggcgcg gaatctggtcatcaaggtca 1320 acagggatgc tgtcatgaaa atcgcggtgg ccggggactt gttccaaccggagcgaggac 1380 tgtatcatct gaacctgacc gtgggaggta ttcccttcca tgagaaggacctcgtgcagc 1440 ctataaaccc tcgtctggat ggctgcatga ggagctggaa ctggctgaacggagaagaca 1500 ccaccatcca ggaaacggtg aaagtgaaca cgaggatgca gtgcttctcggtgacggaga 1560 gaggctcttt ctaccccggg agcggcttcg ccttctacag cctggactacatgcggaccc 1620 ctctggacgt cgggactgaa tcaacctggg aagtagaagt cgtggctcacatccgcccag 1680 ccgcagacac aggcgtgctg tttgcgctct gggcccccga cctccgtgccgtgcctctct 1740 ctgtggcact ggtagactat cactccacga agaaactcaa gaagcagctggtggtcctgg 1800 ccgtggagca tacggccttg gccctaatgg agatcaaggt ctgcgacggccaagagcacg 1860 tggtcaccgt ctcgctgagg gacggtgagg ccaccctgga ggtggacggcaccaggggcc 1920 agagcgaggt gagcgccgcg cagctgcagg agaggctggc cgtgctcgagaggcacctgc 1980 ggagccccgt gctcaccttt gctggcggcc tgccagatgt gccggtgacttcagcgccag 2040 tcaccgcgtt ctaccgcggc tgcatgacac tggaggtcaa ccggaggctgctggacctgg 2100 acgaggcggc gtacaagcac agcgacatca cggcccactc ctgcccccccgtggagcccg 2160 ccgcagccta ggcccccacg ggacgcggca ggcttctcag tctctgtccgagacagccgg 2220 gaggagcctg ggggctcctc accacgtggg gccatgctga gagctgggctttcctctgtg 2280 accatcccgg cctgtaacat atctgtaaat agtgagatgg acttggggcctctgacgccg 2340 cgcactcagc cgtgggcccg ggcgcgggga ggccggcgca gcgcagagcgggctcgaaga 2400 aaataattct ctattatttt tattaccaag cgcttctttc tgactctaaaatatggaaaa 2460 t 2461 7 2127 DNA Homo sapiens 7 ctcgcactcc ctctggccggcccagggcgc cttcagccca acctccccag ccccacgggc 60 gccacggaac ccgctcgatctcgccgccaa ctggtagaca tggagacccc tgcctggccc 120 cgggtcccgc gccccgagaccgccgtcgct cggacgctcc tgctcggctg ggtcttcgcc 180 caggtggccg gcgcttcaggcactacaaat actgtggcag catataattt aacttggaaa 240 tcaactaatt tcaagacaattttggagtgg gaacccaaac ccgtcaatca agtctacact 300 gttcaaataa gcactaagtcaggagattgg aaaagcaaat gcttttacac aacagacaca 360 gagtgtgacc tcaccgacgagattgtgaag gatgtgaagc agacgtactt ggcacgggtc 420 ttctcctacc cggcagggaatgtggagagc accggttctg ctggggagcc tctgtatgag 480 aactccccag agttcacaccttacctggag acaaacctcg gacagccaac aattcagagt 540 tttgaacagg tgggaacaaaagtgaatgtg accgtagaag atgaacggac tttagtcaga 600 aggaacaaca ctttcctaagcctccgggat gtttttggca aggacttaat ttatacactt 660 tattattgga aatcttcaagttcaggaaag aaaacagcca aaacaaacac taatgagttt 720 ttgattgatg tggataaaggagaaaactac tgtttcagtg ttcaagcagt gattccctcc 780 cgaacagtta accggaagagtacagacagc ccggtagagt gtatgggcca ggagaaaggg 840 gaattcagag aaatattctacatcattgga gctgtggtat ttgtggtcat catccttgtc 900 atcatcctgg ctatatctctacacaagtgt agaaaggcag gagtggggca gagctggaag 960 gagaactccc cactgaatgtttcataaagg aagcactgtt ggagctactg caaatgctat 1020 attgcactgt gaccgagaacttttaagagg atagaataca tggaaacgca aatgagtatt 1080 tcggagcatg aagaccctggagttcaaaaa actcttgata tgacctgtta ttaccattag 1140 cattctggtt ttgacatcagcattagtcac tttgaaatgt aacgaatggt actacaacca 1200 attccaagtt ttaatttttaacaccatggc accttttgca cataacatgc tttagattat 1260 atattccgca ctcaaggagtaaccaggtcg tccaagcaaa aacaaatggg aaaatgtctt 1320 aaaaaatcct gggtggacttttgaaaagct tttttttttt tttttttttg agacggagtc 1380 ttgctctgtt gcccaggctggagtgcagta gcacgatctc ggctcactgc accctccgtc 1440 tctcgggttc aagcaattgtctgcctcagc ctcccgagta gctgggatta caggtgcgca 1500 ctaccacacc aagctaatttttgtattttt tagtagagat ggggtttcac catcttggcc 1560 aggctggtct tgaattcctgacctcagttg atccacccac cttggcctcc caaagtgcta 1620 gtattatggg cgtgaaccaccatgcccagc cgaaaagctt ttgaggggct gacttcaatc 1680 catgtaggaa agtaaaatggaaggaaattg ggtgcatttc taggactttt ctaacatatg 1740 tctataatat agtgtttaggttcttttttt tttcaggaat acatttggaa attcaaaaca 1800 attggcaaac tttgtattaatgtgttaagt gcaggagaca ttggtattct gggcaccttc 1860 ctaatatgct ttacaatctgcactttaact gacttaagtg gcattaaaca tttgagagct 1920 aactatattt ttataagactactatacaaa ctacagagtt tatgatttaa ggtacttaaa 1980 gcttctatgg ttgacattgtatatataatt ttttaaaaag gttttctata tggggatttt 2040 ctatttatgt aggtaatattgttctatttg tatatattga gataatttat ttaatatact 2100 ttaaataaag gtgactgggaattgtta 2127 8 5426 DNA Homo sapiens 8 ggggaggaag aaaggcgaag gcaaggcgaaggggtggaga gtgatatgaa gagcgagaga 60 aaagagagga cagcggacga gcagatccggtatctggaat cccggcgcct agaacgtgtt 120 tttcgggaga gcaaaggctg tgtctacggcaggctgggga tatagcctct ccttccgatg 180 aaaagagaaa ggaagaatgg actacagccaccaaacgtcc ctagtcccat gtggacaaga 240 taaatacatt tccaaaaatg aacttctcttgcatctgaag acctacaact tgtactatga 300 aggccagaat ttacagctcc ggcaccgggaggaagaagac gagttcattg tggaggggct 360 cctgaacatc tcctggggcc tgcgccggcccattcgcctg cagatgcagg atgacaacga 420 acgcattcga ccccctccat cctcctcctcctggcactct ggctgtaacc tgggggctca 480 gggaaccact ctgaagcccc tgactgtgcccaaagttcag atctcagagg tggatgcccc 540 gccggagggt gaccagatgc caagctccacagactccagg ggcctgaagc ccctgcagga 600 ggacacccca cagctgatgc gcacacgcagtgatgttggg gtgcgtcgcc gtggcaatgt 660 gaggacgcct agtgaccagc ggcgaatcagacgccaccgc ttctccatca acggccattt 720 ctacaaccat aagacatccg tgttcacaccagcctatggc tctgtcacca acgtccgcat 780 caacagcacc atgaccaccc cacaggtcctgaagctgctg ctcaacaaat ttaagattga 840 gaattcagca gaggagtttg ccttgtacgtggtccatacg agtggtgaga aacagaagct 900 gaaggccacc gattacccgc tgattgcccgaatcctccag ggcccatgtg agcagatctc 960 caaagtgttc ctaatggaga aggaccaggtggaggaagtc acctacgacg tggcccagta 1020 tataaagttc gagatgccgg tacttaaaagcttcattcag aagctccagg aggaagaaga 1080 tcgggaagta aagaagctga tgcgcaagtacaccgtgctc cggctaatga ttcgacagag 1140 gctggaggag atagccgaga ccccagcaacaatctgagcc atgagaacga ggggatctgg 1200 gcaccccagg aaccgccatt gcccataagacccccaggaa gctaggcact ttctttccat 1260 ggaaacattt agacacaaac ctccccagctccggccaagc catcatttgc tacctggagc 1320 tggatgtaga agtcagcaga cagctccctatccctggacc cctgccctcc ttttttctgc 1380 tcacaaggac ttttgatttt agttataaggaggacccaaa atgtgtgtgt gtacatgtgt 1440 gtgcacacat ggtacgtgtc catgtgcctacctgatactt tcacatgtaa ttaaattcca 1500 ggcaaccagc acaagagccg tgagcttggcacatgtgctg ctcgtgagca ggaaaatcag 1560 aggagccact gatctgagtg gtatttaggttgaaggaaag atttctcctc tcaagtgcca 1620 gggagcagcc acacgtctgt ctgtgtttagagagggaaga gggttctcca ggttcaccat 1680 ttgggttgtt tatatgttgg tagaaattctccctgtatgc ctagaaggat cagtgaatgt 1740 aagagccttg gaaattaaca aaataacagccacataacct tgcggcaagt ctgatggaaa 1800 gaaaaagata aaccatccgt ggggtagatgcaataagccc acgtattttt acactggaaa 1860 cgttgattgt tttaaatgac aaagacatatgtgatgttct atgtggaaac ctgtgaagag 1920 tggattctgc ctccatctct gcctccatggctacctttag gagacagaga agatcctgtg 1980 tgtttctctg tacccagctg acagcctgtctctatggcgc ttccttgagt ggaaggaaat 2040 gtctcaagaa acaaagatct cgctggtgcgtacacagtgc tgaccagcta gtgtggccag 2100 ggcctggtgg cctggtggcc aggaagtttcaggttgaagg gaaatgtcga ggctacctgc 2160 agatatgaca ggtgccttga acgcagcccatcttcatgtc atcaaaggtc ttcctgcact 2220 tgaagctggg gcgatgtttg cagtcaagaccattctttcc aacctctggg ttcttgcaag 2280 ttgccctcac cttgtgtgtg gagatgcattccaagaatga agcctcatct tgctactgag 2340 tgtggggttc agggaagctc tttaggccacctggtgaagg tgcatgggga ggatggagct 2400 tctcctcagc tcctctgagc agccacctatgtgatcttta aatccaaccc caatgggaga 2460 aaagggcaag aacagtctgt gccctgggactcctatcagg aagcttgaca ggcagctggg 2520 catcagtgca gctgatatcg tttgaggagggagacagatg cttggacctg ggtgcctggc 2580 tatggagatt gaccaagcaa gatcaggagctcctgatagc aggcgtcttt gagcctagct 2640 ggggtagagg cactgcccat ctcttctccaccttctctcc acagaatgtt tgcagagctg 2700 ggcagttgag gaaaggacag cccctggttggtgcctccaa aggaaggtgg acttttttgg 2760 tggagacgtt tctgccctgg gcaccctcctgcccccgatt catacctatg gcttcttgag 2820 aaggctcaca gctgtggtct taacgtagactgcagaaaga tggcatgcgg cccctggcat 2880 ttcgccaagg gttttatagc aagtctccttcctccatagg gacagcagca ccagccctgt 2940 ggggcatgga gtggaagccc agaagggcttctgcaagctg cacagaactg gggtaagaag 3000 acaaagagta gccaccggga gaggcttcctttgttacagc tgggaaagaa cagttctgtg 3060 aatgcaaaca cctcctgagt tttgcaattgagaaaatgat ttggagaact tctcttctgg 3120 taatttttat tttgaatgtt cagggccttagttggcccca gtaattctcc ttggaggact 3180 tgggagaaga atttccacaa agcaaactactaaccactag ctcttactgg acagcgattt 3240 ctggcttata agagttctct ttgatttgcactagcactac gatagtgtta gatggggaaa 3300 tactgcaaca tgtccagttg gccagatcactttccaaggg agcgatacta aggcagactc 3360 agctttttaa agatgggagg tcaggaggtggaagtgagag gagatcccat ctcacacaac 3420 acacttccac gtaatgcaga ccacacttttccattttgtc ctgccctctt gagaggtcat 3480 ttctcacgtc ctaagaacct gatcagaaattttggaaggg ttctttgaaa tagcagcagt 3540 tgaaacagag acactttgcc acagtgtggagcagattttc tcactggtat cacatggtct 3600 tgcagttttg aactcttcga ccgatttgtgggagtttatg taattgcgtg caatgaacct 3660 gaaattgtgt aaaggacaaa agaccagtttatagggttgg gttttttttc caacttgtga 3720 aaagcagttt agctgcatct gtctccccaccacccccacc ccgggagggg cttatgttac 3780 aaggtgatca agtgaaggaa aaacctgagcctatctggct gggatggtgg aattaagcac 3840 aaggtcacat tctctgtgat cacatgagagggaaggtgat gacttaaatg gcagggggtg 3900 gggattatct tggggagagg ctgaaaagcacaaaagatag tcttccctgt acgtattggt 3960 gaagaacgtg cacaaggctg gatggacttcaacttggagt tgagttgagg caagaggatt 4020 tctggatatt agtcacccat ctgcaagaaaaatgctgagg cctcgggtca agattttgat 4080 ctgagacatg ctgatgcttc aaggagaaatattttcacaa tcctctcttc cctcaccaga 4140 agagaacagt actctctcct agaaacctctaggtaaacac attttatcct aatatcggta 4200 gcatataatg ccccccccaa aatatctgttttccatgcaa aaaagtctca acaagaagtc 4260 tgtggagttg agtggttact tcaaagtgtcaggagagtga agaaattggc cacagaagag 4320 caagaagctc tcttaagaaa agggaattctctttaaagaa accaccacca acaacaaaac 4380 aaccaaaaac catgttttat gtcaaagctctgtagcacag agaatgtggt gtcacagata 4440 catcgccgag agaggtttct ttctttcttttttttttttt tgagacagag tctggttctg 4500 tttcccaggc tggagtgcag tggtgggatctcagctcact gcaacatccg cctctggggt 4560 tcaagtgatt ctcctgtctc agcctcccaagtagctggaa ttacagggac ccgccaccac 4620 gcccggctaa tttttttgtg tggttttagtagaggtgggg tttcaccatc ttggccaggc 4680 tggtcttgaa ctcctgacct cgtgatccacccgcctaggc ctcccaaagt gttgggatta 4740 caggcgtgag ccactgtgcc cagccaaaagagaaatttct acatgaacaa ggcaatttca 4800 gtgtcttaca gcggccaaac catgacgtgaagaatgagat aggagacagg agatcaccat 4860 aagcgtccct gatatagcag cacacattttcacgtttcca cttaaatcgt tttgcacaaa 4920 gtcttgcttc gctcagatga gatgagatatgatttcctag agatgtaaaa ataagaatga 4980 atgtggcgcc cccttcttcc agatgtaatagaaagctctg ccctatcaca aggggggtgt 5040 tgaagcgccc cttgtgtttt aactgtatttaactgagcac aagatgcaca agctgtggtg 5100 ggaaaccctc agtttacctt tggagtcttccctgcagatc gcagacctgt ttccaggctg 5160 atgtttctgg tgtgtaattg ctagcgtttctgaagggttt tcccaattgt tttagccttg 5220 tgaagtattc ttaattataa cttgcctttcagcgatggta catgacttga ttcaacgttt 5280 ggttctgaac ttacacactg atgcgtttactcatctaaca taatctgaca gggcctcagc 5340 aagggagcca tacatttttg taacattttgatatgtttta atgcatctga cttagatctt 5400 actgaaataa agcacttttc aaagag 54269 3095 DNA Homo sapiens 9 tagcagagca atcaccacca agcctggaat aactgcaagggctctgctga catcttcctg 60 aggtgccaag gaaatgagga tggaggaagg aatgaatgttctccatgact ttgggatcca 120 gtcaacacat tacctccagg tgaattacca agactcccaggactggttca tcttggtgtc 180 cgtgatcgca gacctcagga atgccttcta cgtcctcttccccatctggt tccatcttca 240 ggaagctgtg ggcattaaac tcctttgggt agctgtgattggagactggc tcaacctcgt 300 ctttaagtgg attctctttg gacagcgtcc atactggtgggttttggata ctgactacta 360 cagcaacact tccgtgcccc tgataaagca gttccctgtaacctgtgaga ctggaccagg 420 gagcccctct ggccatgcca tgggcacagc aggtgtatactacgtgatgg tcacatctac 480 tctttccatc tttcagggaa agataaagcc gacctacagatttcggtgct tgaatgtcat 540 tttgtggttg ggattctggg ctgtgcagct gaatgtctgtctgtcacgaa tctaccttgc 600 tgctcatttt cctcatcaag ttgttgctgg agtcctgtcaggcattgctg ttacagaaac 660 tttcagccac atccacagca tctataatgc cagcctcaagaaatattttc tcattacctt 720 cttcctgttc agcttcgcca tcggatttta tctgctgctcaagggactgg gtgtagacct 780 cctgtggact ctggagaaag cccagaggtg gtgcgagcagccagaatggg tccacattga 840 caccacaccc tttgccagcc tcctcaagaa cctgggcacgctctttggcc tggggctggc 900 tctcaactcc agcatgtaca gggagagctg caaggggaaactcagcaagt ggctcccatt 960 ccgcctcagc tctattgtag cctccctcgt cctcctgcacgtctttgact ccttgaaacc 1020 cccatcccaa gtcgagctgg tcttctacgt cttgtccttctgcaagagtg cggtagtgcc 1080 cctggcatcc gtcagtgtca tcccctactg cctcgcccaggtcctgggcc agccgcacaa 1140 gaagtcgttg taagagatgt ggagtcttcg gtgtttaaagtcaacaacca tgccagggat 1200 tgaggaggac tactatttga agcaatgggc actggtatttggagcaagtg acatgccatc 1260 cattctgccg tcgtggaatt aaatcacgga tggcagattggagggtcgcc tggcttattc 1320 ccatgtgtga ctccagcctg ccctcagcac agactctttcagatggaggt gccatatcac 1380 gtacaccata tgcaagtttc ccgccaggag gtcctcctctctctacttga atactctcac 1440 aagtagggag ctcactccca ctggaacagc ccattttatctttgaatggt cttctgccag 1500 cccattttga ggccagaggt gctgtcagct caggtggtcctcttttacaa tcctaatcat 1560 attgggtaat gtttttgaaa agctaatgaa gctattgagaaagacctgtt gctagaagtt 1620 gggttgttct ggattttccc ctgaagactt acttattcttccgtcacata tacaaaagca 1680 agacttccag gtagggccag ctcacaagcc caggctggagatcctaactg agaattttct 1740 acctgtgttc attcttaccg agaaaaggag aaaggagctctgaatctgat aggaaaagaa 1800 ggctgcctaa ggaggagttt ttagtatgtg gcgtatcatgcaagtgctat gccaagccat 1860 gtctaaatgg ctttaattat atagtaatgc actctcagtaatgggggacc agcttaagta 1920 taattaatag atggttagtg gggtaattct gcttctagtattttttttac tgtgcataca 1980 tgttcatcgt atttccttgg atttctgaat ggctgcagtgacccagatat tgcactaggt 2040 caaaacattc aggtatagct gacatctcct ctatcacattacatcatcct ccttataagc 2100 ccagctctgc tttttccaga ttcttccact ggctccacatccaccccact ggatcttcag 2160 aaggctagag ggcgactctg gtggtgcttt tgtatgtttcaattaggctc tgaaatcttg 2220 ggcaaaatga caaggggagg gccaggattc ctctctcaggtcactccagt gttactttta 2280 attcctagag ggtaaatatg actcctttct ctatcccaagccaaccaaga gcacattctt 2340 aaaggaaaag tcaacatctt ctctcttttt ttttttttttgagacagggt ctcactatgt 2400 tgcccaggct gctcttgaat tcctgggctc aagcagtcctcccaccctac cacagcgtcc 2460 cgcgtagctg gcatacaggt gcaagccact atgtccagctagccaactcc tccttgcctg 2520 cttttctttt tttttctttt tttgagacgg cgcacctatcacccaggctg gagtggagtg 2580 gcacgatctt ggctcactgc aacctcttcc tcctggttcaagcgattctc atgtctcagc 2640 ctcctcagta gctaggacta ccggcgtgca ccaccatgccaggctaattt ttatattttt 2700 agaattttag aagagatggg atttcatcat gttggccaggctggtctcga actcctgacc 2760 tcaagtgatc cacctgcctt ggcctcccaa ggtgctaggattacaggcat gagccaccgc 2820 accgggccct ccttgcctgt ttttcaatct catctgatatgcagagtatt tctgccccac 2880 ccacctaccc cccaaaaaaa gctgaagcct atttatttgaaagtccttgt ttttgctact 2940 aattatatag tataccatac attatcattc aaaacaaccatcctgctcat aacatctttg 3000 aaaagaaaaa tatatatgtg cagtatttta ttaaagcaacattttattta agaataaagt 3060 cttgttaatt actatatttt agatgcaatg tgatc 309510 4460 DNA Homo sapiens 10 cggggcagca accaggagat tccctgggcc tgcaggaagcccttccgcgg accgaaagat 60 tgttccccat tttggagatg aagaaactga gactcaaagcagctgagtga ccttcccaag 120 gacacacact gaactgggcg gtgatcagga tctgaatgcacagggcgggt gttcagcgat 180 tgtttactac gttgaacgtg acctccagga aagcagttctggccgagatc ccctgacaac 240 gcaaagcaag aagtaacgtg gaaggaggct ccccaagctggctggccatt ttgctgctgt 300 gtgtggaggt gctgtcagtg gcatgcccaa acccaaagctggaagaggaa taaattacaa 360 gtggtcaagg ttgcatcctt ttgagctcag gacctgcttgtaagccgaga gggttctctg 420 gccctaatct agccaagcac catggagaga atcagtgccttcttcagctc tatctgggac 480 accatcttga ccaaacacca agaaggcatc tacaacaccatctgcctggg agtcctcctg 540 ggcctgccac tcttggtgat catcacactc ctcttcatctgttgccattg ctgctggagc 600 ccaccaggca agaggggcca gcagccagag aagaaaaagaagaagaagaa gaagaaggat 660 gaagaagacc tctggatctc tgctcaaccc aagcttctccagatggagaa gagaccatca 720 ctgcctgttt agttaggcag gaagcagagg tgtttcctttctggggctaa gcctccttct 780 gaccacacac agacatttca ggaacccctg aaataatgcactatgtccat gtccacagag 840 taactactca accaaggaac aaacctcaga ctaagtgtcccagtggaggg cagtcccagg 900 gaccacgtgg acaattcttg gatactgtct tggcagctatgtgtccaata gcaatgctcc 960 ttactgcaga cccaggcatg cctcccacct gtctctggcataccccacat gcaaagcaca 1020 aagaacattt atccatacat ctcaatatgg ttcccaagtgtgtgcacatg cacgtaacac 1080 acacacacac aaattcaggt agcaggtacg tgggcaagtatattctgctc atcaaatggt 1140 cattggctat gtactttgtg cagggaagta cattatctacagtcacaaaa atgtctcatg 1200 ggaaagcctt gccagattca gacacatata tacaatttcctaaccagcaa ggcccccata 1260 caccatctat tccataaacc actcaggtta cagatgcatgctttcctatt tctaactcta 1320 cacataaact tttactggaa gtactcataa ttggacattccagcaacctg ctacagtccc 1380 cacccttgtg tgtcttgata cagacacacc aagtttctgtgcctctgacc cctcacctgt 1440 gccaagatgt ttaaagtgtg atggttcaaa attcattgaaagctcttttc ttgtaactca 1500 tgacaaagtc cgtcctcatt gccactgaga ggtgtttaatgtgatccaag acctctctgt 1560 gaaacattac ccccgcaaac cactcagcaa agtgcctttctccaagcaag aacaaagagc 1620 tcttggtggt gactgctaga aaattatgga agcccactcatttatgtcag tggactgcaa 1680 ctgtgtacct gtgcaatgtt tacagatgga aagggtgaggagatgctaca cctgagctag 1740 gtatctccta tataaccaaa gtttccagca gggaaggaactagacaatca tcagtgcagt 1800 ctcacagaag gcaacactgg aagtgatgtc ataaggttgtgatgtgtgca cggtacggca 1860 caggtgggat gcagaggtaa cagagtttaa atgaaagtaggatgaagcta taaagaggtt 1920 tatttatatt tatattgaag ctcaggcaag tgccttgcacacagtaggta cttataacta 1980 actgtggtta ctgttggata tgtgatgttg ttaagggtaagcttgtaata cctcaccaat 2040 tctctgcgag tgatcttctc ttctaagtga gcccactaattgctgcaatg gatgaaattg 2100 ggtgtttaat gctggagagc acatgtaggt gacacatgtgccttgaggta tgtgaggaca 2160 tgtaaattag atccacagtg agctgaggag ggctttccccgccagagtga ggttgggaag 2220 cagagttaat ccacttatag gatgaactgc ttggtatttttattgtattg tgactgtatt 2280 acaaagatgg acaattcact ccttgggagc aagttatgctctagaagttt atttacaaat 2340 atgctgggca gctctcttga aatattttcc caaggaagctattctacaca gtggcaaaat 2400 tgctatctaa ttaataatgt agctaaacta tgatatttatagtagcaaaa aactaaattc 2460 tataagattg cattaaagga aagatatatt ctatttgctcacttgggctg cttggtactc 2520 acctgccctc caggtgtact ttaggcctgt ggagggtgggcatttagtgg tgacccttgc 2580 accagggttt tctaacagat gaccctgtga atcataatttaaacctgcat atattttata 2640 gccagtcaca tttgccctct caccctatat ggccataaactgcctaagca ctcaggcctc 2700 ccactcatca acccctttga ccagagaaag aagcactctggttctctatc cccttgtcac 2760 atagagagtt tgtcatgggg cctctggctg tgcccttcacataacagaat aacttgccat 2820 ctgcctgcac caaacccagg gatgtggaag acatctccccacaactgcca ctgctcacca 2880 ggacaagctg cccttcctgt ctccacctct cagtccccctagaatggatg gctggggaga 2940 ggtggaggct gacagctgag acgtagtgtc agatatgatctaggagggcg gatcaccggg 3000 atccgggacc atacaagtaa catggtttcc atggcaactgcttgctcgtt tgaattaaga 3060 cagcagtcag ttgtcattgc catgacaagg cctctatctccaggcacaat gtccctgctg 3120 tctcctaatc caatggactt gctctcaccc cagggatgaaacacccagaa actcacttct 3180 cagtcacttc cacagccgat gactcagaag agccaaacccagaatggggc ctctcttttc 3240 cccatcacag actcccctga caacctttcc tggcgtaactagaggagtcc cagtgcagga 3300 taggccctaa acgttttgtt aaataaacag gtgcatgaaaggagcctaag gccattgttg 3360 atatccactc tcttctttcc acttccttct catctttttctccatgtttt atgcttctct 3420 gattccctct tctgcctgca ccagaccagc cccagccctttattcctctc cattttcact 3480 ccttccagcc tctgtccctg aactgccact ggcaacccatgggacctcag gaccagagac 3540 tgcttgactc atctggggag ggtaagttca cgggggacaaaaaaatgatt cctaaagaag 3600 aggcttccta gaccagcaca ggctccagaa agacatcccctaggcctgga cttctgagca 3660 gctttagcca ggctccggac ggcagccaga ggaggcctttccccattgct cctttcccca 3720 ttgctcaatg gattccatgt ttctttttct tggggggagcagggagggag aaaggtagaa 3780 aaatggcagc cacctttcca agaaaaatat aaagggtccaagctgtatag tatttgtcag 3840 tatttttttc tgtaaaattc gaacacacac aaaagaaaaatttatttaaa taaaatactt 3900 tgaaaatgaa aagtcttgat gtagtcagat ggttactttcttaacattag gtattacccc 3960 cactcagaca tcactcagaa atgatcaatg cagggactctttctgtgaca caaatgtccc 4020 agccctccct ggtcaccgcc ttcgccatgg tagagtcgtaggtctgagga tgaggaatgt 4080 ggctgtctca cccttgcttg caaaacagat ggccttggagaccagactcc ctcaaaggtg 4140 ccagctacag gaaaaataca ctgatgttcc ttggcaacacttacagaact ttccatcaat 4200 gagggtccat caatggcttc ttaaaggaaa aggggggaaatagcaaaaac ctaaggaaga 4260 atggaccttt gagttaaatc cagtgtttgt tgggaaaggagggatcaaaa acctctatag 4320 tagccactag ggcaaaaact gtgtgtatgt gtgtgtgtatgtgtgtgtac actgttcaat 4380 atggttcaat atggtaccaa tagccacatg tgactatttaaattcattgc aatgaaataa 4440 aattaaaggt atactagctc 4460 11 3076 DNA Homosapiens 11 gaattcaaaa tgtcttcagt tgtaaatctt accattattt tacgtacctctaagaaataa 60 aagtgcttct aattaaaata tgatgtcatt aattatgaaa tacttcttgataacagaagt 120 tttaaaatag ccatcttaga atcagtgaaa tatggtaatg tattattttcctcctttgag 180 ttaggtcttg tgcttttttt tcctggccac taaatttcac aatttccaaaaagcaaaata 240 aacatattct gaatattttt gctgtgaaac acttgacagc agagctttccaccatgaaaa 300 gaagcttcat gagtcacaca ttacatcttt gggttgattg aatgccactgaaacattcta 360 gtagcctgga gaagttgacc tacctgtgga gatgcctgcc attaaatggcatcctgatgg 420 cttaatacac atcactcttc tgtgaagggt tttaattttc aacacagcttactctgtagc 480 atcatgttta cattgtatgt ataaagatta tacaaaggtg caattgtgtatttcttcctt 540 aaaatgtatc agtataggat ttagaatctc catgttgaaa ctctaaatgcatagaaataa 600 aaataataaa aaatttttca ttttggcttt tcagcctagt attaaaactgataaaagcaa 660 agccatgcac aaaactacct ccctagagaa aggctagtcc cttttcttccccattcattt 720 cattatgaac atagtagaaa acagcatatt cttatcaaat ttgatgaaaagcgccaacac 780 gtttgaactg aaatacgact tgtcatgtga actgtaccga atgtctacgtattccacttt 840 tcctgctggg gttcctgtct cagaaaggag tcttgctcgt gctggtttctattacactgg 900 tgtgaatgac aaggtcaaat gcttctgttg tggcctgatg ctggataactggaaaagagg 960 agacagtcct actgaaaagc ataaaaagtt gtatcctagc tgcagattcgttcagagtct 1020 aaattccgtt aacaacttgg aagctacctc tcagcctact tttccttcttcagtaacaaa 1080 ttccacacac tcattacttc cgggtacaga aaacagtgga tatttccgtggctcttattc 1140 aaactctcca tcaaatcctg taaactccag agcaaatcaa gatttttctgccttgatgag 1200 aagttcctac cactgtgcaa tgaataacga aaatgccaga ttacttacttttcagacatg 1260 gccattgact tttctgtcgc caacagatct ggcaaaagca ggcttttactacataggacc 1320 tggagacaga gtggcttgct ttgcctgtgg tggaaaattg agcaattgggaaccgaagga 1380 taatgctatg tcagaacacc tgagacattt tcccaaatgc ccatttatagaaaatcagct 1440 tcaagacact tcaagataca cagtttctaa tctgagcatg cagacacatgcagcccgctt 1500 taaaacattc tttaactggc cctctagtgt tctagttaat cctgagcagcttgcaagtgc 1560 gggtttttat tatgtgggta acagtgatga tgtcaaatgc ttttgctgtgatggtggact 1620 caggtgttgg gaatctggag atgatccatg ggttcaacat gccaagtggtttccaaggtg 1680 tgagtacttg ataagaatta aaggacagga gttcatccgt caagttcaagccagttaccc 1740 tcatctactt gaacagctgc tatccacatc agacagccca ggagatgaaaatgcagagtc 1800 atcaattatc cattttgaac ctggagaaga ccattcagaa gatgcaatcatgatgaatac 1860 tcctgtgatt aatgctgccg tggaaatggg ctttagtaga agcctggtaaaacagacagt 1920 tcaaagaaaa atcctagcaa ctggagagaa ttatagacta gtcaatgatcttgtgttaga 1980 cttactcaat gcagaagatg aaataaggga agaggagaga gaaagagcaactgaggaaaa 2040 agaatcaaat gatttattat taatccggaa gaatagaatg gcactttttcaacatttgac 2100 ttgtgtaatt ccaatcctgg atagtctact aactgccgga attattaatgaacaagaaca 2160 tgatgttatt aaacagaaga cacagacgtc tttacaagca agagaactgattgatacgat 2220 tttagtaaaa ggaaatattg cagccactgt attcagaaac tctctgcaagaagctgaagc 2280 tgtgttatat gagcatttat ttgtgcaaca ggacataaaa tatattcccacagaagatgt 2340 ttcagatcta ccagtggaag aacaattgcg gagactacaa gaagaaagaacatgtaaagt 2400 gtgtatggac aaagaagtgt ccatagtgtt tattccttgt ggtcatctagtagtatgcaa 2460 agattgtgct ccttctttaa gaaagtgtcc tatttgtagg agtacaatcaagggtacagt 2520 tcgtacattt ctttcatgaa gaagaaccaa aacatcatct aaactttagaattaatttat 2580 taaatgtatt ataactttaa cttttatcct aatttggttt ccttaaaatttttatttatt 2640 tacaactcaa aaaacattgt tttgtgtaac atatttatat atgtatctaaaccatatgaa 2700 catatatttt ttagaaacta agagaatgat aggcttttgt tcttatgaacgaaaaagagg 2760 tagcactaca aacacaatat tcaatcaaaa tttcagcatt attgaaattgtaagtgaagt 2820 aaaacttaag atatttgagt taacctttaa gaattttaaa tattttggcattgtactaat 2880 acctggtttt ttttttgttt tgtttttttg tacagacagg gcagcatactgagaccctgc 2940 ctttaaaaac aaacagaaca aaaacaaaac accagggaca catttctctgtcttttttga 3000 tcagtgtcct atacatcgaa ggtgtgcata tatgttgaat gacattttagggacatggtg 3060 tttttataaa gaattc 3076 12 3056 DNA Homo sapiens 12cccagctggt gctgaagctc gtcagttcac catccgccct cggcttccgc ggggcgctgg 60gccgccagcc tcggcaccgt cctttccttt ctccctcgcg ttaggcaggt gacagcaggg 120acatgtctcg ggagatgcag gatgtagacc tcgctgaggt gaagcctttg gtggagaaag 180gggagaccat caccggcctc ctgcaagagt ttgatgtcca ggagcaggac atcgagactt 240tacatggctc tgttcacgtc acgctgtgtg ggactcccaa gggaaaccgg cctgtcatcc 300tcacctacca tgacatcggc atgaaccaca aaacctgcta caaccccctc ttcaactacg 360aggacatgca ggagatcacc cagcactttg ccgtctgcca cgtggacgcc cctggccagc 420aggacggcgc agcctccttc cccgcagggt acatgtaccc ctccatggat cagctggctg 480aaatgcttcc tggagtcctt caacagtttg ggctgaaaag cattattggc atgggaacag 540gagcaggcgc ctacatccta actcgatttg ctctaaacaa ccctgagatg gtggagggcc 600ttgtccttat caacgtgaac ccttgtgcgg aaggctggat ggactgggcc gcctccaaga 660tctcaggatg gacccaagct ctgccggaca tggtggtgtc ccaccttttt gggaaggaag 720aaatgcagag taacgtggaa gtggtccaca cctaccgcca gcacattgtg aatgacatga 780accccggcaa cctgcacctg ttcatcaatg cctacaacag ccggcgcgac ctggagattg 840agcgaccaat gccgggaacc cacacagtca ccctgcagtg ccctgctctg ttggtggttg 900gggacagctc gcctgcagtg gatgccgtgg tggagtgcaa ctcaaaattg gacccaacaa 960agaccactct cctcaagatg gcggactgtg gcggcctccc gcagatctcc cagccggcca 1020agctcgctga ggccttcaag tacttcgtgc agggcatggg atacatgccc tcggctagca 1080tgacccgcct gatgcggtcc cgcacagcct ctggttccag cgtcacttct ctggatggca 1140cccgcagccg ctcccacacc agcgagggca cccgaagccg ctcccacacc agcgagggca 1200cccgcagccg ctcgcacacc agcgaggggg cccacctgga catcaccccc aactcgggtg 1260ctgctgggaa cagcgccggg cccaagtcca tggaggtctc ctgctaggcg gcctgcccag 1320ctgccgcccc cggactctga tctctgtagt ggccccctcc tccccggccc cttttcgccc 1380cctgcctgcc atactgcgcc taactcggta ttaatccaaa gcttattttg taagagtgag 1440ctctggtgga gacaaatgag gtctattacg tgggtgccct ctccaaaggc ggggtggcgg 1500tggaccaaag gaaggaagca agcatctccg catcgcatcc tcttccatta accagtggcc 1560ggttgccact ctcctcccct ccctcagaga caccaaactg ccaaaaacaa gacgcgtagc 1620agcacacact tcacaaagcc aagcctaggc cgccctgagc atcctggttc aaacgggtgc 1680ctggtcagaa ggccagccgc ccacttcccg tttcctcttt aactgaggag aagctgatcc 1740agtttccgga aacaaaatcc ttttctcatt tggggagggg ggtaatagtg acatgcaggc 1800acctctttta aacaggcaaa acaggaaggg ggaaaaggtg ggattcatgt cgaggctaga 1860ggcatttgga acaacaaatc tacgtagtta acttgaagaa accgattttt aaagttggtg 1920catctagaaa gctttgaatg cagaagcaaa caagcttgat ttttctagca tcctcttaat 1980gtgcagcaaa agcaggcaac aaaatctcct ggctttacag acaaaaatat ttcagcaaac 2040gttgggcatc atggtttttg aaggctttag ttctgctttc tgcctctcct ccacagcccc 2100aacctcccac ccctgataca tgagccagtg attattcttg ttcagggaga agatcattta 2160gatttgtttt gcattcctta gaatggaggg caacattcca cagctgccct ggctgtgatg 2220agtgtccttg caggggccgg agtaggagca ctggggtggg ggcggaattg gggttactcg 2280atgtaaggga ttccttgttg ttgtgttgag atccagtgca gttgtgattt ctgtggatcc 2340cagcttggtt ccaggaattt tgtgtgattg gcttaaatcc agttttcaat cttcgacagc 2400tgggctggaa cgtgaactca gtagctgaac ctgtctgacc cggtcacgtt cttggatcct 2460cagaactctt tgctcttgtc ggggtggggg tgggaactca cgtggggagc ggtggctgag 2520aaaatgtaag gattctggaa tacatattcc atgggacttt ccttccctct cctgcttcct 2580cttttcctgc tccctaacct ttcgccgaat ggggcagcac cactgacgtt tctgggcggc 2640cagtgcggct gccaggttcc tgtactactg ccttgtactt ttcattttgg ctcaccgtgg 2700attttctcat aggaagtttg gtcagagtga attgaatatt gtaagtcagc cactgggacc 2760cgaggatttc tgggaccccg cagttgggag gaggaagtag tccagccttc caggtggcgt 2820gagaggcaat gactcgttac ctgccgccca tcaccttgga ggccttccct ggccttgagt 2880agaaaagtcg gggatcgggg caagagaggc tgagtacgga tgggaaacta ttgtgcacaa 2940gtctttccag aggagtttct taatgagata tttgtattta tttccagacc aataaatttg 3000taactttgca gcggaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaa 3056 131930 DNA Homo sapiens 13 ggagagagag aggacagaga gcaagtcact cccggctgcctttttcacct ctgacagagc 60 ccagacacca tgaacgcaag tgaattccga aggagagggaaggagatggt ggattacgtg 120 gccaactaca tggaaggcat tgagggacgc caggtctaccctgacgtgga gcccgggtac 180 ctgcggccgc tgatccctgc cgctgcccct caggagccagacacgtttga ggacatcatc 240 aacgacgttg agaagataat catgcctggg gtgacgcactggcacagccc ctacttcttc 300 gcctacttcc ccactgccag ctcgtacccg gccatgcttgcggacatgct gtgcggggcc 360 attggctgca tcggcttctc ctgggcggca agcccagcatgcacagagct ggagactgtg 420 atgatggact ggctcgggaa gatgctggaa ctaccaaaggcatttttgaa tgagaaagct 480 ggagaagggg gaggagtgat ccagggaagt gccagtgaagccaccctggt ggccctgctg 540 gccgctcgga ccaaagtgat ccatcggctg caggcagcgtccccagagct cacacaggcc 600 gctatcatgg agaagctggt ggcttactca tccgatcaggcacactcctc agtggaaaga 660 gctgggttaa ttggtggagt gaaattaaaa gccatcccctcagatggcaa cttcgccatg 720 cgtgcgtctg ccctgcagga agccctggag agagacaaagcggctggcct gattcctttc 780 tttatggttg ccaccctggg gaccacaaca tgctgctcctttgacaatct cttagaagtc 840 ggtcctatct gcaacaagga agacatatgg ctgcacgttgatgcagccta cgcaggcagt 900 gcattcatct gccctgagtt ccggcacctt ctgaatggagtggagtttgc agattcattc 960 aactttaatc cccacaaatg gctattggtg aattttgactgttctgccat gtgggtgaaa 1020 aagagaacag acttaacggg agcctttaga ctggaccccacttacctgaa gcacagccat 1080 caggattcag ggcttatcac tgactaccgg cattggcagataccactggg cagaagattt 1140 cgctctttga aaatgtggtt tgtatttagg atgtatggagtcaaaggact gcaggcttat 1200 atccgcaagc atgtccagct gtcccatgag tttgagtcactggtgcgcca ggatccccgc 1260 tttgaaatct gtgtggaagt cattctgggg cttgtctgctttcggctaaa gggttccaac 1320 aaagtgaatg aagctcttct gcaaagaata aacagtgccaaaaaaatcca cttggttcca 1380 tgtcacctca gggacaagtt tgtcctgcgc tttgccatctgttctcgcac ggtggaatct 1440 gcccatgtgc agcgggcctg ggaacacatc aaagagctggcggccgacgt gctgcgagca 1500 gagagggagt aggagtgaag ccagctgcag gaatcaaaaattgaagagag atatatctga 1560 aaactggaat aagaagcaaa taaatatcat cctgccttcatggaactcag ctgtctgtgg 1620 cttcccatgt ctttctccaa agccatccag agggttgtgattttgtctgc ttagtatctc 1680 atcaacaaag aaatattatt tgctaattaa aaagttaatcttcatggcca tagcttttat 1740 tcattagctg tgatttttgt tgattaaaac attatagattttcatgttct tgcagtcatc 1800 agaagtggta ggaaagcctc actgatatat tttccagggcaatcaatgtt cacgcaactt 1860 gaaattatat ctgtggtctt caaattgtct tttgtcatgtggctaaatgc ctaataaaca 1920 attcaagtga 1930 14 512 DNA Homo sapiens 14gccctttctg cctctgcggg gctctggtcg ccggccaagg aaaaacgagg ctggaccctg 60aacagcgcgg gctacctgct gggcccacat gccgttggca accacaggtc attcagcgac 120aagaatggcc tcaccagcaa gcgggagctg cggcccgaag atgacatgaa accaggaagc 180tttgacaggt ccatacctga aaacaatatc atgcgcacaa tcattgagtt tctgtctttc 240ttgcatctca aagaggccgg tgccctcgac cgcctcctgg atctccccgc cgcagcctcc 300tcagaagaca tcgagcggtc ctgagagcct cctgggcacg tttgtctgtg tgctgtaacc 360tgaagtcaaa ccttaagata atggataatc ttcggccaat ttatgcggag tcagccattc 420ctgttctctt tgccttgatg ttgtgttgtt atcatttaag attttttttt tttggtaatt 480attttgagtg gcaaaataaa gaatagcaat ta 512 15 1637 DNA Homo sapiens 15gaggcgaacc ggagcgcggg gccgcggtcg ccccgaccag agccgggaga ccgcagcacc 60cgcagccgcc cgcgagcgcg ccgaagacag cgcgcaggcg agagcgcgcg ggcgggggcg 120cgcaggccct gcccgcccct tccgtcccca cccccctccg ccctttcctc tccccacctt 180cctctcgcct cccgcgcccc cgcaccgggc gcccaccctg tcctcctcct gcgggagcgt 240tgtccgtgtt ggcggccgca gcgggccggg ccggtccggc gggccggggg atggcgctgc 300tggacctggc cttggaggga atggccgtct tcgggttcgt cctcttcttg gtgctgtggc 360tgatgcattt catggctatc atctacaccc gattacacct caacaagaag gcaactgaca 420aacagcctta tagcaagctc ccaggtgtct ctcttctgaa accactgaaa ggggtagatc 480ctaacttaat caacaacctg gaaacattct ttgaattgga ttatcccaaa tatgaagtgc 540tcctttgtgt acaagatcat gatgatccag ccattgatgt atgtaagaag cttcttggaa 600aatatccaaa tgttgatgct agattgttta taggtggtaa aaaagttggc attaatccta 660aaattaataa tttaatgcca ggatatgaag ttgcaaagta tgatcttata tggatttgtg 720atagtggaat aagagtaatt ccagatacgc ttactgacat ggtgaatcaa atgacagaaa 780aagtaggctt ggttcacggg ctgccttacg tagcagacag acagggcttt gctgccacct 840tagagcaggt atattttgga acttcacatc caagatacta tatctctgcc aatgtaactg 900gtttcaaatg tgtgacagga atgtcttgtt taatgagaaa agatgtgttg gatcaagcag 960gaggacttat agcttttgct cagtacattg ccgaagatta ctttatggcc aaagcgatag 1020ctgaccgagg ttggaggttt gcaatgtcca ctcaagttgc aatgcaaaac tctggctcat 1080attcaatttc tcagtttcaa tccagaatga tcaggtggac caaactacga attaacatgc 1140ttcctgctac aataatttgt gagccaattt cagaatgctt tgttgccagt ttaattattg 1200gatgggcagc ccaccatgtg ttcagatggg atattatggt atttttcatg tgtcattgcc 1260tggcatggtt tatatttgac tacattcaac tcaggggtgt ccagggtggc acactgtgtt 1320tttcaaaact tgattatgca gtcgcctggt tcatccgcga atccatgaca atatacattt 1380ttttgtctgc attatgggac ccaactataa gctggagaac tggtcgctac agattacgct 1440gtgggggtac agcagaggaa atcctagatg tataactaca gctttgtgac tgtatataaa 1500ggaaaaaaga gaagtattat aaattatgtt tatataaatg cttttaaaaa tctaccttct 1560gtagttttat cacatgtatg ttttggtatc tgttctttaa tttatttttg catggcactt 1620gcatctgtga aaaaaaa 1637 16 2172 DNA Homo sapiens 16 agatcatcaaatcaaattcc acagggattg gtgaccaacc agaaggctca gacatctgat 60 tgctgacctgtccagacatc atctggtctc cctgaacctg aaatcacacc atggatgatt 120 ttgagcgtcgcagagaactt agaaggcaaa agagggagga gatgcgactc gaagcagaaa 180 gaatcgcctaccagaggaat gacgatgatg aagaggaggc agcccgggaa cgccgccgcc 240 gagcccgacaggaacggctg cggcagaagc aggaggaaga atccttggga caggtgaccg 300 accaggtggaggtgaatgcc cagaacagtg tgcctgacga ggaggccaag acaaccacca 360 caaacactcaagtggaaggg gatgatgagg ccgcattcct ggagcgcctg gctcggcgtg 420 aggaaagacgccaaaaacgc cttcaggagg ctctggagcg gcagaaggag ttcgacccaa 480 caataacagatgcaagtctg tcgctcccaa gcagaagaat gcaaaatgac acagcagaaa 540 atgaaactaccgagaaggaa gaaaaaagtg aaagtcgcca agaaagatac gagatagagg 600 aaacagaaacagtcaccaag tcctaccaga agaatgattg gagggatgct gaagaaaaca 660 agaaagaagacaaggaaaag gaggaggagg aagaggagaa gccaaagcga gggagcattg 720 gagaaaatcagatcaaagat gaaaagatta aaaaggacaa agaacccaaa gaagaagtta 780 agagcttcatggatcgaaag aagggattta cagaagttaa gtcgcagaat ggagaattca 840 tgacccacaaacttaaacat actgagaata ctttcagccg ccctggaggg agggccagcg 900 tggacaccaaggaggctgag ggcgcccccc aggtggaagc cggcaaaagg ctggaggagc 960 ttcgtcgtcgtcgcggggag accgagagcg aagagttcga gaagctcaaa cagaagcagc 1020 aggaggcggctttggagctg gaggaactca agaaaaagag ggaggagaga aggaaggtcc 1080 tggaggaggaagagcagagg aggaagcagg aggaagccga tcgaaaactc agagaggagg 1140 aagagaagaggaggctaaag gaagagattg aaaggcgaag agcagaagct gctgagaaac 1200 gccagaagatgccagaagat ggcttgtcag atgacaagaa accattcaag tgtttcactc 1260 ctaaaggttcatctctcaag atagaagagc gagcagaatt tttgaataag tctgtgcaga 1320 aaagcagtggtgtcaaatcg acccatcaag cagcaatagt ctccaagatt gacagcagac 1380 tggagcagtataccagtgca attgagggaa caaaaagcgc aaaacctaca aagccggcag 1440 cctcggatcttcctgttcct gctgaaggtg tacgcaacat caagagtatg tgggagaaag 1500 ggaatgtgttttcatccccc actgcagcag gcacaccaaa taaggaaact gctggcttga 1560 aggtaggggtttctagccgc atcaatgaat ggctaactaa aaccccagat ggaaacaagt 1620 cacctgctcccaaaccttct gacttgagac caggagacgt atccagcaag cggaacctct 1680 gggaaaagcaatctgtggat aaggtcactt cccccactaa ggtttgagac agttccagaa 1740 agaacccaagctcaagacgc aggacgagct cagttgtaga gggctaattc gctctgtttt 1800 gtatttatgttgatttacta aattgggttc attatctttt atttttcaat atcccagtaa 1860 acccatgtatattatcacta tatttaataa tcacagtcta gagatgttca tggtaaaagt 1920 actgcctttgcacaggatcc tgtttctaaa gaaacccatg ctgtgaaata gagacttttc 1980 tactgatcatcataactctg tatctgagca gtgataccaa ccacatctga agtcaacaga 2040 agatccaagtttaaaattgc tgcggaatgt gtgcagtatc tagaaaaatg aaccgtagtt 2100 tttgtttttttaaatacaga agtcatgttg tttctgcact ttataataaa gcatggaaga 2160 aattatcttagt 2172 17 5035 DNA Homo sapiens 17 gcggcggcgg cggcggcggc ggcagcggcggccaagcggc caggttggcg gccggggctc 60 cgggccgcgc gaggccacgg ccacgccgcgccgctgcgca caaccaacga ggcagagcgc 120 cgcccggcgc gagactgcgg ccgaagcgtggggcgcgcgt gcggaggacc aggcgcggcg 180 cggctgcggc tgagagtgga gcctttcaggctggcatgga gagcttaagg ggcaactgaa 240 ggagacacac tggccaagcg cggagttctgcttacttcag tcctgctgag atactctctc 300 agtccgctcg caccgaagga agctgccttgggatcagagc agacataaag ctagaaaaat 360 ttcaagacag aaacagtctc cgccagtcaagaaaccctca aaagtatttt gccatggata 420 tagaagatga agaaaacatg agttccagcagcactgatgt gaaggaaaac cgcaatctgg 480 acaacgtgtc ccccaaggat ggcagcacacctgggcctgg cgagggctct cagctctcca 540 atgggggtgg tggtggcccc ggcagaaagcggcccctgga ggagggcagc aatggccact 600 ccaagtaccg cctgaagaaa aggaggaaaacaccagggcc cgtcctcccc aagaacgccc 660 tgatgcagct gaatgagatc aagcctggtttgcagtacac actcctgtcc cagactgggc 720 ccgtgcacgc gcctttgttt gtcatgtctgtggaggtgaa tggccaggtt tttgagggct 780 ctggtcccac aaagaaaaag gcaaaactccatgctgctga gaaggccttg aggtctttcg 840 ttcagtttcc taatgcctct gaggcccacctggccatggg gaggaccctg tctgtcaaca 900 cggacttcac atctgaccag gccgacttccctgacacgct cttcaatggt tttgaaactc 960 ctgacaaggc ggagcctccc ttttacgtgggctccaatgg ggatgactcc ttcagttcca 1020 gcggggacct cagcttgtct gcttccccggtgcctgccag cctagcccag cctcctctcc 1080 ctgtcttacc accattccca cccccgagtgggaagaatcc cgtgatgatc ttgaacgaac 1140 tgcgcccagg actcaagtat gacttcctctccgagagcgg ggagagccat gccaagagct 1200 tcgtcatgtc tgtggtcgtg gatggtcagttctttgaagg ctcggggaga aacaagaagc 1260 ttgccaaggc ccgggctgcg cagtctgccctggccgccat ttttaacttg cacttggatc 1320 agacgccatc tcgccagcct attcccagtgagggtcttca gctgcattta ccgcaggttt 1380 tagctgacgc tgtctcacgc ctggtcctgggtaagtttgg tgacctgacc gacaacttct 1440 cctcccctca cgctcgcaga aaagtgctggctggagtcgt catgacaaca ggcacagatg 1500 ttaaagatgc caaggtgata agtgtttctacaggaacaaa atgtattaat ggtgaataca 1560 tgagtgatcg tggccttgca ttaaatgactgccatgcaga aataatatct cggagatcct 1620 tgctcagatt tctttataca caacttgagctttacttaaa taacaaagat gatcaaaaaa 1680 gatccatctt tcagaaatca gagcgaggggggtttaggct gaaggagaat gtccagtttc 1740 atctgtacat cagcacctct ccctgtggagatgccagaat cttctcacca catgagccaa 1800 tcctggaagg gtctcgctct tacacccaggctggagtgca gtggtgcaat catggctcac 1860 tgcagcctcg acctcctggg ctcttaagcgatccttccac ctcaaccttc caaggagctg 1920 ggactacaga accagcagat agacacccaaatcgtaaagc aagaggacag ctacggacca 1980 aaatagagtc tggtgagggg acgattccagtgcgctccaa tgcgagcatc caaacgtggg 2040 acggggtgct gcaaggggag cggctgctcaccatgtcctg cagtgacaag attgcacgct 2100 ggaacgtggt gggcatccag ggatccctgctcagcatttt cgtggagccc atttacttct 2160 cgagcatcat cctgggcagc ctttaccacggggaccacct ttccagggcc atgtaccagc 2220 ggatctccaa catagaggac ctgccacctctctacaccct caacaagcct ttgctcagtg 2280 gcatcagcaa tgcagaagca cggcagccagggaaggcccc caacttcagt gtcaactgga 2340 cggtaggcga ctccgctatt gaggtcatcaacgccacgac tgggaaggat gagctgggcc 2400 gcgcgtcccg cctgtgtaag cacgcgttgtactgtcgctg gatgcgtgtg cacggcaagg 2460 ttccctccca cttactacgc tccaagattaccaagcccaa cgtgtaccat gagtccaagc 2520 tggcggcaaa ggagtaccag gccgccaaggcgcgtctgtt cacagccttc atcaaggcgg 2580 ggctgggggc ctgggtggag aagcccaccgagcaggacca gttctcactc acgccctgac 2640 ccgggcagac atgatggggg gtgcagggggctgtgggcat ccagcgtcat cctccagaac 2700 ctcacatctg aactgggggc aggtgcataccttggggagg gagtaggggg acacggggga 2760 ccaccaggtg tccacggttg tccccagcatctcacatcag acctggggca ggtgcgcagt 2820 gtggggaggg gatggggtgc gtcagggcccagcatcgccg cctggcatct ctctgccgca 2880 gcatttcccc ttctgaaccg tccagtgactgctttcaatc tcggtttacg tttagaaatt 2940 gagttctact gagtagggct tccttaagtttaggaaaata gaaattactt tgtgtgaaat 3000 tcttgaataa ataatttatt cagagctaggaatgtggttt ataaaatagg aagtaattgt 3060 gtcaggtcac ttttatgcca cattattttaattgcaaaaa agcatctata tatggaggag 3120 ggtgggaaaa tagaggtagg aaatagtagcctaaaggaaa tcgccacacg tctgtctaaa 3180 cttaggtctc ttttctccgt aggtacctccctgggtagtt ccacacacta ggttgtaaca 3240 gtctctccct gaggagcaga ctcccagcatggtgtagcgt ggccctgtca tgcacatggg 3300 gtcccgcagc agtgactgtg tgtcctgcagaggcgtgacc caggcccctg tagccctcag 3360 cctcctctag aagcttctgt actccttgtaggatcagatc atggaaaact tttctcagtt 3420 tacttctaag taatcacaga taatacatggccagtaatcc caggctggcc attcattcag 3480 gttttttaaa ggatatttaa cttttatggactagaaggaa tcacgagggc tactgcacaa 3540 tacatggcct aagttccctc tgttccttcctctgaatcga atggatgtgg gtgaccgccc 3600 gaaggccttc acaggatgga agtagaatgatttcagtaga tactcattct tggaaaatgc 3660 catagtttta aattattgtt tccagctttatcaaagacat gtttgaaaaa taaaaagcat 3720 ccaagtgaga gctggtgaga ccacgtgctgctggcgtagt gtaggccaga cattgacagt 3780 cctgacggga gctcagggct gcccagcgcccagcgtgcac gggacggccc cacgacagag 3840 ggagtcagcc cgggaggtca ggagcgcggcgggcgagggc cctgtgtgga ccacctccac 3900 caagctcaga gatttgcaac caggtgccttgttgcctccg ctcaggatga aagaggagct 3960 gagagaagtg ctctgcctgc cagtgcagtgcccagctcca aggctctaga gggtgttcag 4020 gtacactgag gaggggacgg ctccgtcttcacattgtgca cagatctgag gatgggatta 4080 gcgaagctgt ggagactgca catccggacctgcccatgtc tcaaaacaaa cacatgtaca 4140 gtggctcttt ttccttctca aacactttaccccagaagca ggtggtctgc cccaggcata 4200 aagaaggaaa attggccatc tttcccacctctaaattctg taaaattata gacttgctca 4260 aaagattcct ttttatcatc cccacgctgtgtaagtggaa agggcattgt gttccgtgtg 4320 tgtccagttt acagcgtctc tgccccctagcgtgttttgt gacaatctcc cctgggtgag 4380 gagtgggtgc acccagcccc gaggccagtggttgctcggg gccttccgtg tgagttctag 4440 tgttcacttg atgccgggga atagaattagagaaaactct gacctgccgg gttccaggga 4500 ctggtggagg tggatggcag gtccgactcgaccatgactt agttgtaagg gtgtgtcggc 4560 tttttcagtc tcatgtgaaa atcctcctgtctctggcagc actgtctgca ctttcttgtt 4620 tactgtttga agggacgagt accaagccacaaggaacact tcttttggcc acagcataag 4680 ctgatggtat gtaaggaacc gatgggccattaaacatgaa ctgaacggtt aaaagcacag 4740 tctatggaac gctaatggag tcagcccctaaagctgtttg ctttttcagg ctttggatta 4800 catgctttta atttgatttt agaatctggacactttctat gaatgtaatt cggctgagaa 4860 acatgttgct gagatgcaat cctcagtgttctctgtatgt aaatctgtgt atacaccaca 4920 cgttacaact gcatgagctt cctctcgcacaagaccagct ggaactgagc atgagacgct 4980 gtcaaataca gacaaaggat ttgagatgttctcaataaaa agaaaatgtt tcact 5035 18 1700 DNA Homo sapiens 18 gccgaggctgcctgactgga atgagggtag ctgcggcgac tgcggcggct ggagcggggc 60 cggccatggcggtgtggacg cgggccacca aagcggggct ggtggagctg ctcctgaggg 120 agcgctgggtccgagtggtg gccgagctga gcggggagag cctgagcctg acgggcgacg 180 ccgccgcggccgagctggag cccgctctgg gacccgcggc cgccgccttc aacggcctcc 240 caaacggcggcggcgcgggc gactcgctgc ccgggagccc aagccgcggc ctggggcccc 300 cgagcccgccggcgccgcct cggggccccg cgggtgaggc gggcgcgtcg ccgcccgtgc 360 gccgggtgcgggtggtgaag caagaggcgg gcggcctggg catcagcatc aagggcggcc 420 gcgagaaccggatgccgatc ctcatctcca agatcttccc cgggctggct gccgaccaga 480 gccgggcgctgcggctgggc gacgccatcc tgtcggtgaa cggcaccgac ctgcgccagg 540 ccacccacgaccaggccgtg caggcgctga agcgcgcggg caaggaggtg ctgctggagg 600 tcaagttcatccgagaagta acaccatata tcaagaagcc atcattagta tcagatctgc 660 cgtgggaaggtgcagccccc cagtcaccaa gctttagtgg cagtgaggac tctggttcgc 720 caaaacaccagaacagcacc aaggacagga agatcatccc tctcaaaatg tgctttgctg 780 ctagaaacctaagcatgccg gatctggaaa acagattgat agagctacat tctcctgata 840 gcaggaacacgttgatccta cgctgcaaag atacagccac agcacactcc tggttcgtag 900 ctatccacaccaacataatg gctctcctcc cacaggtgtt ggctgaactc aacgccatgc 960 ttggggcaaccagtacagca ggaggcagta aagaggtgaa gcatattgcc tggctggcag 1020 aacaggcaaaactagatggt ggaagacagc aatggagacc tgtcctcatg gctgtgactg 1080 agaaggatttgctgctctat gactgtatgc cgtggacaag agatgcctgg gcgtcaccat 1140 gccacagctacccacttgtt gccaccaggt tggttcattc tggctccgga tgtcgatccc 1200 cctcccttggatctgacctt acatttgcta ccaggacagg ctctcgacag ggcattgaga 1260 tgcatctcttcagggtggag acacatcggg atctgtcatc ctggaccagg atacttgttc 1320 agggttgccatgctgctgct gagctgatca aggaagtctc tctaggctgc atgttaaatg 1380 gccaagaggtgaggcttact attcactatg aaaatgggtt caccatctca agggaaaatg 1440 gaggctccagcagcatattg taccgctacc cctttgaaag gctgaagatg tctgctgatg 1500 atggcatccgaaatctatac ttggattttg gtggtcccga gggagaactg accatggacc 1560 tgcactcttgtccgaagccg attgtatttg tgttgcacac gtttttatcg gccaaagtca 1620 ctcgtatgggactgcttgta tgagcaacaa aaaatcagaa aagagccttg actgtcacaa 1680 gaaatatttccacctccaaa 1700 19 3086 DNA Homo sapiens 19 actgccacct cggtcggtcggtgcttactt cgctgccagc tggtctgtcg ccatgaaccc 60 ggacctgcgc agggagcgggattccgccag cttcaacccg gagctgctta cacacatcct 120 ggacggcagc cccgagaaaacgcggcgccg ccgagagatc gagaacatga tcctgaacga 180 cccagacttc cagcatgaggacttgaactt cctaactcgc agccagcgtt atgaggtggc 240 tgtcaggaaa agtgccatcatggtgaagaa gatgagggag tttggcatcg ctgaccctga 300 tgaaattatg tggtttaaaaaactacattt ggtcaatttt gtggaacctg tgggcctcaa 360 ttactccatg tttattcctaccttgctgaa tcagggcacc actgctgaga aagagaaatg 420 gctgctttca tccaaaggactccagataat tggcacctac gcccagacgg aaatgggcca 480 cggaactcac cttcgaggcttggaaaccac agccacgtat gaccctgaaa cccaggagtt 540 cattctcaac agtcctactgtgacctccat taaatggtgg cctggtgggc ttggaaaaac 600 ttcaaatcat gcaatagtccttgcccagct catcactaag gggaaatgct atggattaca 660 tgcctttatc gtacctatccgtgaaatcgg gacccataag cctttgccag gaattaccgt 720 tggtgacatc gggcccaaatttggttatga tgagatagac aatggctacc tcaaaatgga 780 caaccatcgt attcccagagaaaacatgct gatgaagtat gcccaggtga agcctgatgg 840 cacatacgtg aaaccgctgagtaacaagct gacttacggg accatggtgt ttgtcaggtc 900 cttccttgtg ggagaagctgctcgggctct gtctaaggcg tgcaccattg ccatccgata 960 cagcgctgtg aggcaccagtctgaaatgaa gccaggtgaa ccagaaccac agattttgga 1020 ttttcaaacc cagcagtataaactctttcc actcctggcc actgcctatg ccttccagtt 1080 tgtgggcgca tacatgaaggagacctatca ccggattaac gaaggcattg gtcaagggga 1140 cctgagtgaa ctgcctgagcttcatgccct caccgctgga ctgaaggctt tcacctcctg 1200 gactgcaaac actggcattgaagcatgtcg gatggcttgt ggtgggcatg gctattctca 1260 ttgcagtggt cttccaaatatttatgtcaa tttcacccca agctgtacct ttgagggaga 1320 aaacactgtc atgatgctccagacggctag gttcctgatg aaaagttatg atcaggtgca 1380 ctcaggaaag ttggtgtgtggcatggtgtc ctatttgaac gacctgccca gtcagcgcat 1440 ccagccacag caggtagcagtctggccaac catggtggat atcaacagcc ccgaaagcct 1500 aaccgaagca tataaactccgtgcagccag attagtagaa attgctgcaa aaaaccttca 1560 aaaagaagtg attcacagaaaaagcaagga ggtagcttgg aacctaactt ctgttgacct 1620 tgttcgagca agtgaggcacattgccacta tgtggtagtt aagctctttt cagaaaaact 1680 cctcaaaatt caagataaagccattcaagc tgtcttaagg agtttatgtc tgctgtattc 1740 tctgtatgga atcagtcagaacgcggggga tttccttcag gggagcatca tgacagagcc 1800 tcagattaca caagtaaaccagcgtgtaaa ggagttactc actctgattc gctcagatgc 1860 tgttgctttg gttgatgcatttgattttca ggatgtgaca cttggctctg tgcttggccg 1920 ctatgatggg aatgtgtatgaaaacttgtt tgagtgggct aagaactccc cactgaacaa 1980 agcagaggtc cacgaatcttaccacaagca cctgaagtca ctgcagtcca agctctgaag 2040 tgtcacaagg acaagtttaatctgcttcag aaagcgcctg tgtgcaactc aaattttgtg 2100 gaatcttttc gaattcaaatagctatagag caaatgataa attgacccct ttttataaat 2160 ggagggaaaa aatgaacagatttcagagat taaatgaaaa aaagcagatg tgttttaagt 2220 gcaattaaca ctgaaagagacctgttaaac cattcagaaa aagcttaaga aatgcgatat 2280 gacttccttt tgtaatgctgctgatcccag tagactatga cttttgataa ttagcagaat 2340 ttaactactg agtagttgattattttcaca ttttaattgc taatcactgg ctatataagt 2400 gtttttaagc aagggtatttttgaagtggt gtagaaccct tccacgcttt cctgctcagt 2460 gttctaccag acaagaaaagggacttgggg aaggaaactt attggaaact tgatgcgaat 2520 taggttcttc tttgcacaaactctgcctgc ttgctctccc ttgctgatgg gttgcaattc 2580 tcaaactatt catgctagcaatttttccac gggggggcct ttttcccacg ggggcctcta 2640 taggggccca tttctccggtaaataggaat ttccccttta aggggtgcca gtagtaggag 2700 tatagggaac ctctcagctgtggcactgtt gtagctttgg agtcagagtg tactctgggc 2760 aatcagattt ccacatattctgcatcttgg ataagcatta aaagttggga tactaatttg 2820 gataaaaaaa tgcactaggcaaactccagc gagacagaaa gtatagggaa acctctcagc 2880 tgtggcactg ttgtagctttggagtgcaga gtgtaactct ggcgacaatc agatttcaca 2940 tattctgtca tcttggcataagccattaaa agcttggaga ttactgtatt tggcattaaa 3000 aaaaaatgtc acttaggtcagcactcccag acgtagcaca gaaaaaccct ttgacacaaa 3060 ccatgtgttc tgatttttggttcaga 3086 20 1302 DNA Homo sapiens 20 gcttcgggtg ccatggggac tcctcccggcctgcagaccg actgcgaggc gctgctcagc 60 cgcttccagg agacggacag tgtacgcttcgaggacttca cggagctctg gagaaacatg 120 aagttcggga ctatcttctg tggcagaatgagaaatttag aaaagaacat gtttacaaaa 180 gaagctttag ctttggcttg gcgatattttttacctccat acaccttcca gatcagagtt 240 ggtgctttgt atctgctata tggattatataatacccaac tgtgtcaacc aaaacaaaag 300 atcagagttg ccctgaagga ttgggatgaagttttaaaat ttcagcaaga tttagtaaat 360 gcacagcatt ttgatgcagc ttatatttttaggaagctac gactagacag agcatttcac 420 tttacagcaa tgcccaaatt gctgtcatataggatgaaga aaaaaattca ccgagctgaa 480 gttacagaag aatttaagga cccaagtgatcgtgtgatga aacttatcac ttctgatgta 540 ttagaggaaa tgctgaatgt tcatgatcattatcagaaca tgaaacatgt aatttcagtt 600 gataagtcca agccagataa agccctcagcttgataaagg atgatttttt tgacaatatt 660 aagaacatag ttttggagca tcagcagtggcacaaagaca gaaagaatcc atccttaaag 720 tcaaaaacta atgatggaga agaaaaaatggaaggaaatt cacaagaaac ggagagatgt 780 gaaagggcag aatcattagc gaaaataaaatcaaaggcct tttcagttgt catacaggca 840 tccaaatcaa gaaggcatcg tcaagtcaaactcgactctt ctgactctga ttctgcatct 900 ggtcaagggc aagtcaaagc aactaggaaaaaagagaaga aagaaagatt gaaaccagca 960 ggaaggaaga tgtctctcag aaacaaaggcaatgtgcaga atatacacaa ggaagataaa 1020 cctttaagtc tgagtatgcc tgtaattacagaagaagaag agaatgaaag tttgagtgga 1080 acagagttca ctgcatccaa gaagaggagaaaacactgaa caaagagcct ggtgtagttt 1140 ttaattttga gttttctgac agaagaaaagattgatattt tgtgtattga acaggaagac 1200 tgccagtatt aaaaaaatcc ttctgggaatctgtaggtta tttcttggaa attgcaatac 1260 gtagttctag aataaaagta caaaaaattagaataagaat tc 1302 21 2081 DNA Homo sapiens 21 atggatggat ggcccgccaagagaaggagc agtgcactgt ggtcagagat gctggacatc 60 accatgaagg agtctctcaccaccagggag atcagacggc aggaggcaat atatgaaatg 120 tcccgaggtg aacaggatttaattgaggat ctcaaacttg caagaaaggc ctaccatgac 180 cccatgttaa agttgtccatcatgtcagaa gaggaactca cacatatatt tggtgatctg 240 gactcttaca tacctctgcatgaagatttg ttgacaagaa taggagaagc aaccaagcct 300 gatggaacag tggagcagattggtcacatt ctcgtgagct ggttaccgcg cttgaatgcc 360 tacagaggtt actgtagtaaccagctggca gccaaagctc ttcttgatca aaagaaacag 420 gatccaagag tccaagacttcctccagcga tgtctcgagt ctcccttcag tcgaaaacta 480 gatctttgga gtttcctagatatccctcga agtcgcctag tcaaataccc tttactgtta 540 aaagaaattc ttaaacacactccaaaagag caccctgatg ttcagcttct ggaggatgct 600 atattgataa tacagggagtcctctctgat atcaacttga agaaaggtga atccgagtgc 660 cagtattaca tcgacaagctggagtacctg gatgaaaagc agagggaccc cagaatcgaa 720 gcgagcaaag tgctgctgtgccatggggag ctgcggagca agagtggaca taaactttac 780 attttcctgt ttcaagacatcttggttctg actcggcccg tcacacggaa cgaacggcac 840 tcttaccagg tttaccggcagccaatccca gtccaagagc tagtcctaga agacctgcag 900 gatggagatg tgagaatgggaggctccttt cgaggagctt tcagtaactc agagaaagct 960 aaaaatatct ttagaattcgcttccatgac ccctctccag cccagtctca cactctgcaa 1020 gccaatgacg tgttccacaagcagcagtgg ttcaactgta ttcgagcggc cattgccccc 1080 ttccagtcgg caggcagtccacctgagctg cagggcctgc cggagctgca cgaagagtgt 1140 gaggggaacc acccctctgcgaggaaactc acagcccaga ggagggcatc cacagtttcc 1200 agtgttactc aggtagaagttgatgaaaac gcttacagat gtggctctgg catgcagatg 1260 gcagaggaca gcaagagcttaaagacacac cagacacagc ccggcatccg aagagcgagg 1320 gacaaagccc ttctggtggcaaacggaaag agactttggt gtagagaagg ctctgtgtgt 1380 taactgatgg gagagactgtttgtttataa atgtgtacag ttttgttttc tcgtaagggg 1440 agcatcatag ggttactttataccagttgt aacattttca ttgtttttgg ttgttctttt 1500 ttcttttttt aatggcagctaaagatatac agattactgt taaattgcag tccttttttt 1560 tttaaagata ttttcttgagttatttagaa catggtaagc ctggtatttt ttaatcaaac 1620 aaaatattta tgaaatgggttttctcttaa ttctggattc atcatggctt tctaatacca 1680 attgtaatat ttacaatattcaccaaaact tagaattttg caaatgcagg aattctgcca 1740 gtgtttcttt gctaagccttgcatgcaaaa tttgaaattt taacattggc acccaaaacc 1800 tacatggaat gtatgtctggagtatttcaa actttacatt gaaacataat ttccttggaa 1860 aacaaaccat aagcctgaggaggtttttat caactggaat gctttatatt agtttgtttt 1920 tcactgtaca ttcctcattttacattcatt taacctgccg attatttaat ttttttattg 1980 taaagtagtt tttagcatttgcttttattt ttttactttg atgccttaac aaattggcac 2040 gtctttaaag tatttttcttcctgattaaa aatgtgtgtg t 2081 22 968 DNA Homo sapiens 22 gaattccgaagccggcgacc ggtctgacgt cccgagcagg gcatggtcta gtggcccagt 60 caggacgcgaaacactccct ggaggttctg acccactccc tctcagcctc cgcctggtct 120 ctggtgtagtcgccgccgcc agccgccatg ggcaaacaga acagcaagct gcggcccgag 180 gtgctgcaggacctgcggga gaagacggag ttcaccgacc acgagctgca ggagtggtac 240 aagggcttcctcaaggactg ccccaccggc cacctgaccg tggacgagtt caagaagatc 300 tacgccaacttcttccccta cggcgacgct tccaagttcg ccgagcacgt cttccgcacc 360 ttcgacaccaacggcgacgg caccatcgac ttccgggagt tcatcattgg cctgagcgtg 420 actcgcgggggcaagctgga gcagaagctc aagtgggcct tcagcatgta cgacctggac 480 ggcaacggctacatcagccg cagcgagatg ctggagatcg tgcaggccat ctacaagatg 540 gtgtcgtctgtgatgaagat gccggaggat gagtccaccc cggagaagcg cacagacaag 600 atcttcaggcagatggacac caacaatgac ggcaaactgt ccttggaaga attcatcaga 660 ggtgccaagagcgacccctc catcgtccgc ctgctgcagt gcgaccccag cagtgccagt 720 cagttctgagcgagcggccc ctggacagtt gcagagaaac acaggcttgt cgtgccgttt 780 aagctttgcttgcaagagtg gatgccccgc aatcgttcct gctctcccgg gcccccgctg 840 ggcatgtccgtttgcacctg cccgggcgcc ggtgcgcctc cctcctccac ctgaccaacg 900 cgacattcctcccctcacgc ctggcccggt cccttccagg aactccaggg atgtggtgac 960 atgcaggg 96823 1204 DNA Homo sapiens 23 ctctgaggag aagcagcagc aaacatttgc tagtcagacaagtgacaggg aatggattcc 60 aaacagcagt gtgtaaagct aaatgatggc cacttcatgcctgtattggg atttggcacc 120 tatgcacctc cagaggttcc gagaagtaaa gctttggaggtcacaaaatt agcaatagaa 180 gctgggttcc gccatataga ttctgctcat ttatacaataatgaggagca ggttggactg 240 gccatccgaa gcaagattgc agatggcagt gtgaagagagaagacatatt ctacacttca 300 aagctttggt ccacttttca tcgaccagag ttggtccgaccagccttgga aaactcactg 360 aaaaaagctc aattggacta tgttgacctc tatcttattcattctccaat gtctctaaag 420 ccaggtgagg aactttcacc aacagatgaa aatggaaaagtaatatttga catagtggat 480 ctctgtacca cctgggaggc catggagaag tgtaaggatgcaggattggc caagtccatt 540 ggggtgtcaa acttcaaccg caggcagctg gagatgatcctcaacaagcc aggactcaag 600 tacaagcctg tctgcaacca ggtagaatgt catccgtatttcaaccggag taaattgcta 660 gatttctgca agtcgaaaga tattgttctg gttgcctatagtgctctggg atctcaacga 720 gacaaacgat gggtggaccc gaactccccg gtgctcttggaggacccagt cctttgtgcc 780 ttggcaaaaa agcacaagcg aaccccagcc ctgattgccctgcgctacca gctgcagcgt 840 ggggttgtgg tcctggccaa gagctacaat gagcagcgcatcagacagaa cgtgcaggtt 900 tttgagttcc agttgactgc agaggacatg aaagccatagatggcctaga cagaaatctc 960 cactatttta acagtgatag ttttgctagc caccctaattatccatattc agatgaatat 1020 taacatggag agctttgcct gatgtctacc agaagccctgtgtgtggatg gtgacgcaga 1080 ggacgtctct atgccggtga ctggacatat cacctctacttaaatccgtc ctgtttagcg 1140 acttcagtca actacagctg agtccatagg ccagaaagacaataaatttt tatcattttg 1200 aaat 1204 24 1698 DNA Homo sapiens 24tcggcacagg agcgaggaga cccgagagca gacgcgccct ggcgcccgcc ctgcgcagtc 60accatggcga tgcatttcat cttctcagat acagcggtgc ttctgtttga tttctggagt 120gtccacagtc ctgctggcat ggccctttcg gtgttggtgc tcctgcttct ggctgtactg 180tatgaaggca tcaaggttgg caaagccaag ctgctcaacc aggtactggt gaacctgcca 240acctccatca gccagcagac catcgcagag acagacgggg actctgcagg ctcagattca 300ttccctgttg gcagaaccca ccacaggtgg tacttgtgtc actttggcca gtctctaatc 360catgtcatcc aggtggtcat cggctacttc atcatgctgg ccgtaatgtc ctacaacacc 420tggattttcc ttggtgtggt cttgggctct gctgtgggct actacctagc ttacccactt 480ctcagcacag cttagatggt gaggaacgtg caggcactga ggctggaggg acatggagcc 540ccctcttcca gacactatac ttccaactgc cctttcttct gatggctatt cctccacctt 600attcccagcc cctggaaact ttgagctgaa gccagcactt gctccctgga gttcggaagc 660cattgcagca accttccttc tcagccagcc tacgtagggc ccaggcatgg tcttgtgtct 720taagacagct gctgtgacca aagggagaat ggagataaca ggggtggcag ggttactgag 780cccatgacaa tgcttctctg tgactcaaac caggaatttc caaagatttc aagccaggga 840gaagggttct tggtgatgca gggcatggaa cctggacacc ctcagctctc ctgctttgtg 900ccttatctac aggagcatcg cccattggac ttcctgacct cttctgtctt tgagggacag 960agaccaagct agatcctttt tctcaccttt ctgcctttgg aacacatgaa gatcatctcg 1020tctatggatc atgttgacaa actaagtttt ttttattttt cccattgaac tcctagttgg 1080caattttgca cattcataca aaaaaatttt taatgaaatg atttcattga ttcatgatgg 1140atggcagaaa ctgctgagac ctatttccct ttcttgggga gagaataagt gacagctgat 1200taaaggcaga gacacaggac tgctttcagg ctcctggttt attctctgat tgactgagct 1260ccttccacca gaaggcactg cctgcaggaa gaagatgatc tgatggccgt gggtgtctgg 1320gaagctcttc gtggcctcaa tgccctcctt tatcctcatc tttcttctat gcagaacaaa 1380aagctgcatc taataatgtt caatacttaa tattctctat ttattactta ctgcttactc 1440gtaatgatct agtggggaaa catgattcat tcacttaaaa tactgattaa gccatgggca 1500ggtactgact gaagatgcaa tccaaccaaa gccattacat tttttgagtt agatgggact 1560ctctggatag ttgaacctct tcactttata aaaaaggaaa gagagaaaat cactgctgta 1620tactaaatac ctcacagatt agatgaaaag atggttgtaa gctttgggaa ttaaaaacaa 1680atacatttta gtaaatat 1698 25 3213 DNA Homo sapiens 25 aatcatcgctcgcagcggcg gcgcccgcag tggccgcagc agcgcgccgg gccctggccg 60 cgccccagccgagcgcagcg cggagtcgcc ccgacctttc tctgcgcagt acggccgccg 120 ggaccgcagcatggcgggca tcgcggccaa gctggcgaag gaccgggagg cggccgaggg 180 gctgggctcccacgagaggg ccatcaagta cctcaaccag gactacgagg cgctgcggaa 240 cgagtgcctggaggccggga cgctcttcca ggacccgtcc ttcccggcca tcccctcggc 300 cctgggcttcaaggagttgg ggccctactc cagcaaaacc cggggcatga gatggaagcg 360 ccccacggagatctgcgctg acccccagtt tatcattgga ggagccaccc gcacagacat 420 ctgccaaggagccctaggtg actgctggct gctggcagcc attgcctccc tcaccttgaa 480 tgaagaaatcctggctcgag tcgtccccct aaaccagagc ttccaggaaa actatgcagg 540 gatctttcacttccagttct ggcaatacgg cgagtgggtg gaggtggtgg tggatgacag 600 gctgcccaccaaggacgggg agctgctctt tgtgcattca gccgaaggga gcgagttctg 660 gagcgccctgctggagaagg catacgccaa gatcaacgga tgctatgaag ctctatcagg 720 gggtgccaccactgagggct tcgaagactt caccggaggc attgctgagt ggtatgagtt 780 gaagaagccccctcccaacc tgttcaagat catccagaaa gctctgcaaa aaggctctct 840 ccttggctgctccatcgaca tcaccagcgc cgcggactcg gaggccatca cgtttcagaa 900 gctggtgaaggggcacgcgt actcggtcac cggagccgag gaggttgaaa gtaacggaag 960 cctacagaaactgatccgca tccgaaatcc ctggggagaa gtggagtgga cagggcggtg 1020 gaatgacaactgcccaagct ggaacactat agacccagag gagagggaaa ggctgaccag 1080 acggcatgaagatggagaat tctggatgtc tttcagtgac ttcctgaggc actattcccg 1140 cctggagatctgtaacctga ccccagacac tctcaccagc gatacctaca agaagtggaa 1200 actcaccaaaatggatggga actggaggcg gggctccacc gcgggaggtt gcaggaacta 1260 cccgaacacattctggatga accctcagta cctgatcaag ctggaggagg aggatgagga 1320 cgaggaggatggggagagcg gctgcacctt cctggtgggg ctcattcaga agcaccgacg 1380 gcggcagaggaagatgggcg aggacatgca caccatcggc tttggcatct atgaggttcc 1440 agaggagttaagtgggcaga ccaacatcca cctcagcaaa aacttcttcc tgacgaatcg 1500 cgccagggagcgctcagaca ccttcatcaa cctccgggag gtgctcaacc gcttcaagct 1560 gccgccaggagagtacattc tcgtgccttc caccttcgaa cccaacaagg atggggattt 1620 ctgcatccgggtcttttctg aaaagaaagc tgactaccaa gctgtcgatg atgaaatcga 1680 ggccaatcttgaagagttcg acatcagcga ggatgacatt gatgatggag tcaggagact 1740 gtttgcccagttggcaggag aggatgcgga gatctctgcc tttgagctgc agaccatcct 1800 gagaagggttctagcaaagc gccaagatat caagtcagat ggcttcagca tcgagacatg 1860 caaaattatggttgacatgc tagattcgga cgggagtggc aagctggggc tgaaggagtt 1920 ctacattctctggacgaaga ttcaaaaata ccaaaaaatt taccgagaaa tcgacgttga 1980 caggtctggtaccatgaatt cctatgaaat gcggaaggca ttagaagaag caggtttcaa 2040 gatgccctgtcaactccacc aagtcatcgt tgctcggttt gcagatgacc agctcatcat 2100 cgattttgataattttgttc ggtgtttggt tcggctggaa acgctattca agatatttaa 2160 gcagctggatcccgagaata ctggaacaat agagctcgac cttatctctt ggctctgttt 2220 ctcagtactttgaagttata actaatctgc ctgaagactt ctcatgatgg aaaatcagcc 2280 aaggactaagcttccataga aatacacttt gtatctggac ctcaaaatta tgggaacatt 2340 tacttaaacggatgatcata gctgaaaata atgatactgt caatttgaga tagcagaagt 2400 ttcacacatcaaagtaaaag atttgcatat cattatacta aatgcaaatg agtcgcttaa 2460 cccttgacaaggtcaaagaa agctttaaat ctgtaaatag tatacacttt ttacttttac 2520 acactttcctgttcatagca atattaaatc aggaaaaaaa aatgcaggga ggtatttaac 2580 agctgagcaaaaacattgag tcgctctcaa aggacacgag gcccttggca gggaatattt 2640 aaagcaacttcaagtttaaa atgcagctgt tgattctacc aaacaacagt ccaagattac 2700 catttcccatgagccaactg ggaaacatgg tatatcatga agtaatcttg tcaaggcatc 2760 tggagagtccaggagaggag actcacctct gtcgcttggg ttaaacaaga gacaggtttt 2820 gtagaatattgattggtaat agtaaatcgt tctccttaca atcaagttct tgaccctatt 2880 cggccttatacatctggtct tacaaagacc aaagggatcc tgcgcttgat caactgaacc 2940 agtatgccaaaaccaggcat ccaatttgta aaccaattat gataaaggac aaaataagct 3000 gtttgccacctcaaaacttt atgaacttca ccaccactag tgtctgtcca tggagttaga 3060 ggggacatcacttagaagtt cttatagaaa ggacacaagt ttgtttcctg gctttacctt 3120 gggaaaatgctagcaacatt atagaaattt tgccttgttg ccttatcttc ttccaaatgt 3180 actgttaaataaaaataaag ggttacccca tcg 3213 26 5316 DNA Homo sapiens 26 atcatggcggatggccccag gtgtaagcgc agaaagcagg cgaacccgcg gcgcaataac 60 gttacaaattataatactgt ggtagaaaca aattcagatt cagatgatga agacaaactg 120 catattgtggaagaagaaag tgttacagat gcagctgact gtgaaggtgt accagaggat 180 gacctgccaacagaccagac agtgttacca gggaggagca gtgaaagaga agggaatgct 240 aagaactgctgggaggatga cagaaaggaa gggcaagaaa tcctggggcc tgaagctcag 300 gcagatgaagcaggatgtac agtaaaagat gatgaatgcg agtcagatgc agaaaatgag 360 caaaaccatgatcctaatgt tgaagagttt ctacaacaac aagacactgc tgtcattttt 420 cctgaggcacctgaagagga ccagaggcag ggcacaccag aagccagtgg tcatgatgaa 480 aatggaacaccagatgcatt ttcacaatta ctcacctgtc catattgtga tagaggctat 540 aaacgctttacctctctgaa agaacacatt aaatatcgtc atgaaaagaa tgaagataac 600 tttagttgctccctgtgcag ttacaccttt gcatacagaa cccaacttga acgtcacatg 660 acatcacataaatcaggaag agatcaaaga catgtgacgc agtctgggtg taatcgtaaa 720 ttcaaatgcactgagtgtgg aaaagctttc aaatacaaac atcacctaaa agagcactta 780 agaattcacagtggagagaa gccatatgaa tgcccaaact gcaagaaacg cttttcccat 840 tctggctcctatagctcaca cataagcagt aagaaatgta tcagcttgat acctgtgaat 900 gggcgaccaagaacaggact caagacatct cagtgttctt caccgtctct ttcagcatca 960 ccaggcagtcccacacgacc acagatacgg caaaagatag agaataaacc ccttcaagaa 1020 caactttctgttaaccaaat taaaactgaa cctgtggatt atgaattcaa acccatagtg 1080 gttgcttcaggaatcaactg ttcaacccct ttacaaaatg gggttttcac tggtggtggc 1140 ccattacaggcaaccagttc tcctcagggc atggtgcaag ctgttgttct gccaacagtt 1200 ggtttggtgtctcccataag tatcaattta agtgatattc agaatgtact taaagtggcg 1260 gtagatggtaatgtaataag gcaagtgttg gagaataatc aagccaatct tgcatccaaa 1320 gaacaagaaacaatcaatgc ttcacccata caacaaggtg gccattctgt tatttcagcc 1380 atcagtcttcctttggttga tcaagatgga acaaccaaaa ttatcatcaa ctacagtctt 1440 gagcagcctagccaacttca agttgttcct caaaatttaa aaaaagaaaa tccagtcgct 1500 acaaacagttgtaaaagtga aaagttacca gaagatctta ctgttaagtc tgagaaggac 1560 aaaagctttgaagggggggt gaatgatagc acttgtcttc tgtgtgatga ttgtccagga 1620 gatattaatgcacttccaga attaaagcac tatgacctaa agcagcctac tcagcctcct 1680 ccactccctgcagcagaagc tgagaagcct gagtcctctg tttcatcagc tactggagat 1740 ggcaatttgtctcctagtca gccaccttta aagaacctct tgtctctcct aaaagcatat 1800 tatgctttgaatgcacaacc aagtgcagaa gagctctcaa aaattgctga ttcagtaaac 1860 ctaccactggatgtagtaaa aaagtggttt gaaaagatgc aagctggaca gatttcagtg 1920 cagtcttctgaaccatcttc tcctgaacca ggcaaagtaa atatccctgc caagaacaat 1980 gatcagcctcaatctgcaaa tgcaaatgaa ccccaggaca gcacagtaaa tctacaaagt 2040 cctttgaagatgactaactc cccagtttta ccagtgggat caaccaccaa tggttccaga 2100 agtagtacaccatccccatc acctctaaac ctttcctcat ccagaaatac acagggttac 2160 ttgtacacagctgagggtgc acaagaagag ccacaagtag aacctcttga tctttcacta 2220 ccaaagcaacagggagaatt attagaaagg tcaactatca ctagtgttta ccagaacagt 2280 gtttattctgtccaggaaga acccttgaac ttgtcttgcg caaaaaagga gccacaaaag 2340 gacagttgtgttacagactc agaaccagtt gtaaatgtaa tcccaccaag tgccaacccc 2400 ataaatatcgctatacctac agtcactgcc cagttaccca caatcgtggc cattgctgac 2460 cagaacagtgttccatgctt aagagcgcta gctgccaata agcaaacgat tctgattccc 2520 caggtggcatacacctactc aactacggtc agccctgcag tccaagaacc acccttgaaa 2580 gtgatccagccaaatggaaa tcaggatgaa agacaagata ctagctcaga aggagtatca 2640 aatgtagaggatcagaatga ctctgattct acaccgccca aaaagaaaat gcggaagaca 2700 gaaaatggaatgtatgcttg tgatttgtgt gacaagatat tccaaaagag tagttcatta 2760 ttgagacataaatatgaaca cacaggtaaa agacctcatg agtgtggaat ctgtaaaaag 2820 gcatttaaacacaaacatca tttgattgaa cacatgcgat tacattctgg agaaaagccc 2880 tatcaatgtgacaaatgtgg aaagcgcttc tcacactctg ggtcttattc tcaacacatg 2940 aatcatcgctactcctactg taagagagaa gcggaagaac gtgacagcac agagcaggaa 3000 gaggcagggcctgaaatcct ctcgaatgag cacgtgggtg ccagggcgtc tccctcacag 3060 ggcgactcggacgagagaga gagtttgaca agggaagagg atgaagacag tgaaaaagag 3120 gaagaggaggaggataaaga gatggaagaa ttgcaggaag aaaaagaatg tgaaaaacca 3180 caaggggatgaggaagagga ggaggaggag gaagaagtgg aagaagaaga ggtagaagag 3240 gcagagaatgagggagaaga agcaaaaact gaaggtctga tgaaggatga cagggctgaa 3300 agtcaagcaagcagcttagg acaaaaagta ggcgagagta gtgagcaagt gtctgaagaa 3360 aagacaaatgaagcctaatc gtttttctag aaggaaaata aattctaatt gataatgaat 3420 ttcgttcaatattatccttg cttttcatgg aaacacagta acctgtatgc tgtgattcct 3480 gttcactactgtgtgtgtgt gcgcgtgcat tgattactat ccatttcttt agtcaacgct 3540 ctccacttcctgatttctgc tttaaggaaa actgtgaact ttctgcttca tgtatcagtt 3600 ttaaagcatcccaggcaaag atcatctaca gattctagga attctctccc ctgaaatcaa 3660 aacctggagacttttttttc ttattttagt tgagaagttc ataaactgct caaggattag 3720 ttttccaggactctgcggag gaacggcagg aagaacctca gagagggcag aggtgacttc 3780 aaagtgctggggactccgtc ctgagggtca cttggccctg agcccctgcg tgcccttgcg 3840 gaagcccagaagcttcttcc tgctgcacct cccgtttccg ctgctgctga cgtttatgca 3900 tttcatgatggggtccaaca agaacacctg acttgggtga agttgtgcaa tattggaggc 3960 tgactgtagggctgggcagc tgggagacag gctcatggct catggctcat ggctcagggc 4020 ggtgcctgccctgggccggg acccccctcc ccacccccca cctaggcttt ttgggttttg 4080 ttcaaggaaggtaaagtgag aggtttaggt cagtgttttt aagtttttgt ttttttttta 4140 aagcaaatcctgtatatgta tctacatggg agataggtag acactactta tttgttacat 4200 tttgtactatacgtttgtgt tccaggtttc agcttccctc gctcctgttg ttaagaagcg 4260 tccctgtcagcacaggtgtg cattgaggaa ggggccccag ggccttcgct ccctcagcac 4320 tggggtggaggcggcaggaa ggggcggccc ttacctggca ggtctgggcg cacctttagc 4380 aggtggactccgtggggctc caccagccag aagcctctgg aaggcaacga aggcaatgct 4440 gctccctgagtccagtcccc gcccccaaac ccagcccagg tgccttcagc tacttcggct 4500 tcttaaaccctgcagtgtta aacagaggca ttgagaaagg ggaaaggcgg gtatttttaa 4560 aagccaaagattgacccaag ttacttgagg gtagggaggc gggcccagtg caggaggctg 4620 catccctggcctgctggtgc ccaccggggg ctgtgcctgt gccgggccgc aggaagctgg 4680 ctgcccccattcctgctgct gctgctgctg ctgctctgtg gctgtttcaa agactgggcg 4740 aaaggctgtccggagggcag accaggtgcc ttgccgcaga gaaaacacca aagtctcctg 4800 ttcgctcataaagaagtttt tgggatggga gagaatccag accatcttgg ggcagccagg 4860 cccttgccttcatttttaca gaggtagcac aactgattcc aacacaaaac cccttcccct 4920 ttttaaaatgatttctgttc taatgccata gatcaaaggc ctcagaaacc attgtgtgtt 4980 tcctctttgaagcaatgaca agcactttac tttcacggtg gtttttgttt tttcttattg 5040 ctgtggaacctcttttggag gacgttaaag gcgtgtttta cttgtttttt taagagtgtg 5100 tgatgtgtgttttgtagatt tcttgacagt gctgtaatac agacggcaat gcaatagcct 5160 atttaaagaactacgtgatc tgattgagat gtacatagtt ttttttttta ccataactga 5220 attattttatctcttatgtt atcatgagaa atgtatgcca aatgattagt tgatgtatgt 5280 tttttaatttaatatttaaa taaaatattt ggaagg 5316 27 3045 DNA Homo sapiens 27 aattcccttgaggtggtttc acatccacat ccagttgtcc ctaaaatgga gaaagaactg 60 gtgccagaccaggcagtaat atcagacagt actttctctc tggcaaacag tccaggcagt 120 gaatcagtaaccaaggatga cgcactttct tttgtcccct cccagaaaga aaagggaaca 180 gcaactcctgaactacatac agctacagat tatagagatg gcccagatgg aaattcgaat 240 gagcctgatacgcggccact agaagacagg gcagtaggcc tgtccacatc ctccactgct 300 gcagagcttcagcacgggat ggggaatacc agtctcacag gacttggtgg agagcatgag 360 ggtcccgcccctccagcaat cccagaagct ctgaatatca aggggaacac tgactcttcc 420 gtgcaaagtgtgggtaaggc cactttggct ttagattcag ttttgactga agaaggaaaa 480 gttctggtggtttcagaaag ctctgcagct caggaacaag ataaggataa agcggtgacc 540 tgttcctctattaaggaaaa tgctctctct tcaggaactt tgcaggaaga gcagagaaca 600 ccacctcctggacaagatac tcaacaattt catgaaaaat caatctcagc tgactgtgcc 660 aaggacaaagcacttcagct aagtaattca ccgggtgcat cctctgcctt tcttaaggca 720 gaaactgaacataacaagga agtggcccca caagtctcac tgctgactca aggtggggct 780 gcccagagcctggtgccacc aggagcaagt ctggccacag agtcaaggca ggaagccttg 840 ggggcagagcacaacagctc cgctctgttg ccatgtctgt tgccagatgg gtctgatggg 900 tccgatgctcttaactgcag tcagccttct cctctggatg ttggagtgaa gaacactcaa 960 tcccagggaaaaactagtgc ctgtgaggtg agtggagatg tgacggtgga tgttacaggg 1020 gttaatgctctacaaggtat ggctgagccc agaagagaga atatatcaca caacacccaa 1080 gacatcctgattccaaacgt cttgttgagc caagagaaga atgccgttct aggtttgcca 1140 gtggctctacaggacaaagc tgtgactgac ccacagggag ttggaacccc agagatgata 1200 cctcttgattgggagaaagg gaagctggag ggagcagacc acagctgtac catgggtgac 1260 gctgaggaagcccaaataga cgatgaagca catcctgtcc tactgcagcc tgttgccaag 1320 gagctccccacagacatgga gctctcagcc catgatgatg gggccccagc tggtgtgagg 1380 gaagtcatgcgagccccgcc ttcaggcagg gaaaggagca ctccctctct accttgcatg 1440 gtctctgcccaggacgcacc tctgcctaag ggggcagact tgatagagga ggctgccagc 1500 cgtatagtggatgctgtcat cgaacaagtc aaggccgctg gagcactgct tactgagggg 1560 gaggcctgtcacatgtcact gtccagccct gagttgggtc ctctcactaa aggactagag 1620 agtgcttttacagaaaaagt gagtactttc ccacctgggg agagcctacc aatgggcagt 1680 actcctgaggaagccacggg gagccttgca ggatgttttg ctggaaggga ggagccagag 1740 aagatcattttacctgtcca ggggcctgag ccagcagcag aaatgccaga cgtgaaagct 1800 gaagatgaagtggattttag agcaagttca atttctgaag aagtggctgt agggagcata 1860 gctgctacactgaagatgaa gcaaggccca atgacccagg cgataaaccg agaaaactgg 1920 tgtacaatagagccatgccc tgatgcagca tctcttctgg cttccaagca gagcccagaa 1980 tgtgagaacttcctggatgt tggactgggc agagagtgta cctcaaaaca aggtgtactt 2040 aaaagagaatctgggagtga ttctgacctc tttcactcac ccagtgatga catggacagc 2100 atcatcttcccaaagccaga ggaagagcat ttggcctgtg atatcaccgg atccagttca 2160 tccaccgatgacacggcttc actggaccga cattcttctc atggcagtga tgtgtctctc 2220 tcccagattttaaagccaaa caggtcaaga gatcggcaaa gccttgatgg attctacagc 2280 catgggatgggagctgaggg tcgagaaagt gagagtgagc ctgctgaccc aggcgacgtg 2340 gaggaggaggagatggacag tatcactgaa gtgcctgcaa actgctctgt cctaaggagc 2400 tccatgcgctctctttctcc cttccggagg cacagctggg ggcctgggaa aaatgcagcc 2460 agcgatgcagaaatgaacca ccggagttca atgcgagttc ttggggatgt tgtcaggaga 2520 cctcccattcataggagaag tttcagtcta gaaggcttga caggaggagc tggtgtcgga 2580 aacaagccatcctcatctct agaagtaagc tctgcaaatg ccgaagagct cagacaccca 2640 ttcagtggtgaggaacgggt tgactctttg gtgtcacttt cagaagagga tctggagtca 2700 gaccagagagaacataggat gtttgatcag cagatatgtc acagatctaa gcagcaggga 2760 tttaattactgtacatcagc catttcctct ccattgacaa aatccatctc attaatgaca 2820 atcagccatcctggattgga caattcacgg cccttccaca gtaccttcca caataccagt 2880 gctaatctgactgagagtat aacagaagag aactataatt tcctgccaca tagcccctcc 2940 aagaaagattctgaatggaa gagtggaaca aaagtcagtc gtacattcag ctacatcaag 3000 aataaaatgtctagcagcaa gaagagcaaa gaaaagaaaa aaaag 3045 28 3634 DNA Homo sapiens 28tcaacacagg acaatgcaag cccatgagct gttccggtat tttcgaatgc cagagctggt 60tgacttccga cagtacgtgc gtactcttcc gaccaacacg cttatgggct tcggagcttt 120tgcagcactc accaccttct ggtacgccac gagacccaaa cccctgaagc cgccatgcga 180cctctccatg cagtcagtgg aagtggcggg tagtggtggt gcacgaagat ccgcactact 240tgacagcgac gagcccttgg tgtatttcta tgatgatgtc acaacattat acgaaggttt 300ccagagggga atacaggtgt caaataatgg cccttgttta ggctctcgga aaccagacca 360accctatgaa tggctttcat ataaacaggt tgcagaattg tcggagtgca taggctcagc 420actgatccag aagggcttca agactgcccc agatcagttc attggcatct ttgctcaaaa 480tagacctgag tgggtgatta ttgaacaagg atgctttgct tattcgatgg tgatcgttcc 540actttatgat acccttggaa atgaagccat cacgtacata gtcaacaaag ctgaactctc 600tctggttttt gttgacaagc cagagaaggc caaactctta ttagagggtg tagaaaataa 660gttaatacca ggccttaaaa tcatagttgt catggatgcc tacggcagtg aactggtgga 720acgaggccag aggtgtgggg tggaagtcac cagcatgaag gcgatggagg acctgggaag 780agccaacaga cggaagccca agcctccagc acctgaagat cttgcagtaa tttgtttcac 840aagtggaact acaggcaacc ccaaaggagc aatggtcact caccgaaaca tagtgagcga 900ttgttcagct tttgtgaaag caacagagaa tacagtcaat ccttgcccag atgatacttt 960gatatctttc ttgcctctcg cccatatgtt tgagagagtt gtagagtgtg taatgctgtg 1020tcatggagct aaaatcggat ttttccaagg agatatcagg ctgctcatgg atgacctcaa 1080ggtgcttcaa cccactgtct tccccgtggt tccaagactg ctgaaccgga tgtttgaccg 1140aattttcgga caagcaaaca ccacgctgaa gcgatggctc ttggactttg cctccaagag 1200gaaagaagca gagcttcgca gcggcatcat cagaaacaac agcctgtggg accggctgat 1260cttccacaaa gtacagtcga gcctgggcgg aagagtccgg ctgatggtga caggagccgc 1320cccggtgtct gccactgtgc tgacgttcct cagagcagcc ctgggctgtc agttttatga 1380aggatacgga cagacagagt gcactgccgg gtgctgccta accatgcctg gagactggac 1440cgcaggccat gttggggccc cgatgccgtg caatttgata aaacttgttg atgtggaaga 1500aatgaattac atggctgccg agggcgaggg cgaggtgtgt gtgaaagggc caaatgtatt 1560tcagggctac ttgaaggacc cagcgaaaac agcagaagct ttggacaaag acggctggtt 1620acacacaggg gacattggaa aatggttacc aaatggcacc ttgaaaatta tcgaccggaa 1680aaagcacata tttaagctgg cacaaggaga atacatagcc cctgaaaaga ttgaaaatat 1740ctacatgcga agtgagcctg ttgctcaggt gtttgtccac ggagaaagcc tgcaggcatt 1800tctcattgca attgtggtac cagatgttga gacattatgt tcctgggccc aaaagagagg 1860atttgaaggg tcgtttgagg aactgtgcag aaataaggat gtcaaaaaag ctatcctcga 1920agatatggtg agacttggga aggattctgg tctgaaacca tttgaacagg tcaaaggcat 1980cacattgcac cctgaattat tttctatcga caatggcctt ctgactccaa caatgaaggc 2040gaaaaggcca gagctgcgga actatttcag gtcgcagata gatgacctct attccactat 2100caaggtttag tgtgaagaag aaagctcaga ggaaatggca cagttccaca atctcttctc 2160ctgctgatgg ccttcatgtt gttaattttg aatacagcaa gtgtagggaa ggaagcgttc 2220gtgtttgact tgtccattcg gggttcttct cataggaatg ctagaggaaa cagaacaccg 2280ccttacagtc acctcatgtt gcagaccatg tttatggtaa tacacacttt ccaaaatgag 2340ccttaaaaat tgtaaagggg atactataaa tgtgctaagt tatttgagac ttcctcagtt 2400taaaaagtgg gttttaaatc ttctgtctcc ctgcttttct aatcaagggg ttaggacttt 2460gctatctctg agatgtctgc tacttgctgc aaattctgca gctgtctgct gctctaaaga 2520gtacagtgca ctagagggaa gtgttccctt taaaaataag aacaactgtc ctggctggag 2580aatctcacaa gcggaccaga gatcttttta aatccctgct actgtccctt ctcacaggca 2640ttcacagaac ccttctgatt cgtaagggtt acgaaactca tgttcttctc cagtcccctg 2700tggtttctgt tggagcataa ggtttccagt aagcgggagg gcagatccaa ctcagaacca 2760tgcagataag gagcctctgg caaatgggtg ctcatcagaa cgcgtggatt ctctttcatg 2820gcagaatgct cttggactcg gttctccagg cctgattccc cgactccatc ctttttcagg 2880ggttatttaa aaatctgcct tagattctat agtgaagaca agcatttcaa gaaagagtta 2940cctggatcag ccatgctcag ctgtgacgcc tgaataactg tctactttat cttcactgaa 3000ccactcactc tgtgtaaagg ccaacagatt tttaatgtgg ttttcatatc aaaagatcat 3060gttgggatta acttgccttt ttccccaaaa aataaactct caggcaagca tttctttaaa 3120gctattaagg gagtatatac ttgagtactt attgaaatgg acagtaataa gcaaatgttc 3180ttataatgct acctgatttc tatgaaatgt gtttgacaag ccaaaattct aggatgtaga 3240aatctggaaa gttcatttcc tgggattcac ttctccaggg attttttaaa gttaatttgg 3300gaaattaaca gcagttcact ttattgtgag tctttgccac atttgactga attgagctgt 3360catttgtaca tttaaagcag ctgttttggg gtctgtgaga gtacatgtat tatatacaag 3420cacaacaggg cttgcactaa agaattgtca ttgtaataac actacttggt agcctaactt 3480catatatgta ttcttaattg cacaaaaagt caataatttg tcaccttggg gttttgaatg 3540tttgctttaa gtgttggcta tttctatgtt ttataaacca aaacaaaatt tccaaaaaca 3600atgaaggaaa ccaaaataaa tatttctgca tttc 3634 29 4573 DNA Homo sapiens 29cgcgtgtcta cgcggacgca ccggctaagc tgcttctgcc gccgccggcc gcctgggacc 60ttgcggtgag gctgcgcggg gccgaggccg cctccgagcg ccaggtttat tcagtcacca 120tgaagctgct gctgctgcac ccggccttcc agagctgcct cctgctgacc ctgcttggct 180tatggagaac cacccctgag gctcacgctt catccctggg tgcaccagct atcagcgctg 240cctccttcct gcaggatcta atacatcggt atggcgaggg tgacagcctc actctgcagc 300agctgaaggc cctgctcaac cacctggatg tgggagtggg ccggggtaat gtcacccagc 360acgtgcaagg acacaggaac ctctccacgt gctttagttc tggagacctc ttcactgccc 420acaatttcag cgagcagtcg cggattggga gcagcgagct ccaggagttc tgccccacca 480tcctccagca gctggattcc cgggcctgca cctcggagaa ccaggaaaac gaggagaatg 540agcagacgga ggaggggcgg ccaagcgctg ttgaagtgtg gggatacggt ctcctctgtg 600tgaccgtcat ctccctctgc tccctcctgg gggccagcgt ggtgcccttc atgaagaaga 660ccttttacaa gaggctgctg ctctacttca tagctctggc gattggaacc ctctactcca 720acgccctctt ccagctcatc ccggaggcat ttggtttcaa ccctctggaa gattattatg 780tctccaagtc tgcagtggtg tttgggggct tttatctttt ctttttcaca gagaagatct 840tgaagattct tcttaagcag aaaaatgagc atcatcatgg acacagccat tatgcctctg 900agtcgcttcc ctccaagaag gaccaggagg agggggtgat ggagaagctg cagaacgggg 960acctggacca catgattcct cagcactgca gcagtgagct ggacggcaag gcgcccatgg 1020tggacgagaa ggtcattgtg ggctcgctct ctgtgcagga cctgcaggct tcccagagtg 1080cttgctactg gctgaaaggt gtccgctact ctgatatcgg cactctggcc tggatgatca 1140ctctgagcga cggcctccac aatttcatcg atggcctggc catcggtgct tccttcactg 1200tgtcagtttt ccaaggcatc agcacctcgg tggccatcct ctgtgaggag ttcccacatg 1260agctaggaga ctttgtcatc ctgctcaacg ctgggatgag catccaacaa gctctcttct 1320tcaacttcct ttctgcctgc tgctgctacc tgggtctggc ctttggcatc ctggccggca 1380gccacttctc tgccaactgg atttttgcgc tagctggagg aatgttcttg tatatttctc 1440tggctgatat gttccctgag atgaatgagg tctgtcaaga ggatgaaagg aagggcagca 1500tcttgattcc atttatcatc cagaacctgg gcctcctgac tggattcacc atcatggtgg 1560tcctcaccat gtattcagga cagatccaga ttgggtaggg ctctgccaag agcctgtggg 1620actggaagtc gggccctggg ctgcccgatc gccagcccga ggacttacca tccacaatgc 1680accacggaag aggccgttct atgaaaaact gacacagact gtattcctgc attcaaatgt 1740cagccgtttg taaaatgctg tatcctagga ataagctgcc ctggtaacca gtctctagct 1800agtgcctctt gccctctcct cacctccttt tctctcagtg actctggaac ctgaatgcag 1860cttacaagac aagcctgact tttttctctg attaccttgg cctcctcttg gaaccagtgc 1920tgaaaggttt tgaatccttt acccaacaat gcaaaaatag agccaatggt tataacttgg 1980ctagaaatat caagagttga atccatagtg tggggcccat gactctagct gggcaccttg 2040gacctccagc tggccaatag aagagacagg agacaggaag ccttcccatt ttttcaaagt 2100ctgtttaatt gcctattact tctctcaaag agaacctgaa gtcagaacac atgagcaggg 2160tgagaggtga ggcaaggttc atcctgaatg ggagaggaag tcgaaccact gctgtgtgtc 2220ttgtcaggat gctcacttgt tcctactgag atgctggata ttgattttgt aacagcacct 2280ggtgtttcac ggctgtccga gtgagctaac gtggcggtgt ggctgcctgg acctcctctt 2340tcaggttaac gctgacagaa tggaggctca ggctgtctgc aagaaaacag ttggtttggc 2400tgtgattttg acctcctctt ccccactgcc atcttctaag agactttgta gctgcctcct 2460agaagcacat tctgagcaca tttgagacct ctgtgttaga ggggagactg cacaaactat 2520cctcccccag gttgagacgt ctgcagagtg gcaagctgac ttgtagaaat ggggtgccat 2580ttatgctcta cttagacaag ggtaatcaga aatggaatca gtgcaggcaa aatttaggat 2640ttgccgcttc cataaatcaa agcatgacta atagggggtc tctgaaatgt aagggcacaa 2700acttcactta gggcatcgca gatgtttgca gaatggttgg cctaatgatt atgctacaga 2760tgggttttaa atgacccgtc taggttactg cttccttgca aaaaaagtcg aatcctgcat 2820tgaattgaat atgaatttct ctaactctct ccagaaaatg gatggagata acttgtcttt 2880aaaactgtag gccagcctta gccactgtgg agcccttgcc tccgagctct ggcttcaagg 2940ggagctcttc tccaggttca ctaggtgaat tgatttatta ttatcatatt gataatgtga 3000gattctttag ccactttggg gagcctgtct ctccagaagc ctttcttagt ggtgcccaca 3060gttggagccc aggggccatg tttgcaaact gattcatgtg catggctgac aggagtactg 3120gttcactacc aatgcctgag cttttctctt acatagaaaa actgtccact ctcagtaatc 3180acaagcagca tccgttttgt tttctcttct tgggagacat ctgtcaaacc aggaatattc 3240ttgaaaagaa cgtgagcagg aaaaactgct ggtgatactt tttttaagtt ttgtttttat 3300cttgcctgtt ggcttcaata catttgagaa tacgctgaag agggaaaatt tcagtgatgg 3360agattctaga ttaaatatca ggactgattt cctggtggga ttatggtcca gttttaccaa 3420agaaccaatt ccttgaatgt tggaatctaa ctttttatat tgtcattatt attgttgttt 3480ttaaacggtt ctttgtcttt tctgttttat ttttctcaag ctgctttcag gagctagcag 3540aaaataactc aaagttgaag actctggaag attttgcttt aacctaactc gcattgatgt 3600attaaattta taattttagc attcccaata gatcctatca ttccttaaac ataataccct 3660ttgtcttgga gtagaatact aagttagagt tagtggattt ctagtttagg agaggagctc 3720aaaactataa tctttaacaa attgaaaaat gaaatagggt gttttccctt tttgtgcaca 3780cctatattac cttaagaaat ttccttccat agacagctgc ctcaaaggga aatcctcttt 3840aaaccgtagt tggcgcagag gtcagtccta gtcggagctt aggaggggcg gagacgctca 3900catcgtctga cttgagtcgc cactgattgt ggcaacagct ttgcctcatg agtcaaaaat 3960tggcaatttc ttttgatttt tagttgttga atttgctgtt tcaagcattt gtacatatta 4020gaagtctaag gagtagcaag tcagtgggag gactttttca cccctggcat tagcagcttc 4080gacctcattt tccagatgca ccagctccta ttaataagtt agcaaggaaa gtgtatgtca 4140cgtgcaggaa cagtgaggca gggacagggg ttctgctcct tctcacttca ccaccggcac 4200acagcttgcc cctgtctttg cccccaaagg tattttgtgt ctagtgtcaa attggagcta 4260ttcttcactg gtccttaacc ttgggtttta aaaagaaggc ttctctgttt gggtagcgta 4320agagctgagt atagtaagtc ctcttccaaa gagatggcaa tatgctgggc atctacttta 4380aaacaaagtt gtctgatttt tgcaagagag gttaggattt tattgttctt atttcccttt 4440acagttctgc agttccatca cagtattttt ttaaataact caggtgtatg agcagaaatt 4500agaaaagaaa attaacttat gtggactgta aatgttttat ttgtaagatt ctataaataa 4560agctatattc tgt 4573 30 1707 DNA Homo sapiens 30 cggcgctggg ctgaggggaggggttgtctt aaaagtctct ccttccccct gtaggggcgg 60 ccggcgagtc ccagtgagagcggagggtgc cagaggtagg gggccgagaa acaaagttcc 120 cggggcttcc tccggggccgcggtcggggc tgcgcgtttg accgcccccc tcctcgcgaa 180 gcaatggctt ccaaactcctgcgcgcggtc atcctcgggc cgcccggctc gggcaagggc 240 accgtgtgcc agaggatcgcccagaacttt ggtctccagc atctctccag cggccacttc 300 ttgcgggaga acatcaaggccagcaccgaa gttggtgaga tggcaaagca gtatatagag 360 aaaagtcttt tggttccagaccatgtgatc acacgcctaa tgatgtccga gttggagaac 420 aggcgtggac agcactggctccttgatggt tttcctagga cattaggaca agccgaagcc 480 ctggacaaaa tctgtgaagtggatctagtg atcagtttga atattccatt tgaaacactt 540 aaagatcgtc tcagccgccgttggattcac cctcctagcg gaagggtata taacctggac 600 ttcaatccac ctcatgtacatggtattgat gacgtcactg gtgaaccgtt agtccagcag 660 gaggatgata aacccgaagcagttgctgcc aggctaagac agtacaaaga cgtggcaaag 720 ccagtcattg aattatacaagagccgagga gtgctccacc aattttccgg aacggagacg 780 aacaaaatct ggccctacgtttacacactt ttctcaaaca agatcacacc tattcagtcc 840 aaagaagcat attgaccctgcccaatggaa gaaccaggaa gatgtggtca ttcattcaat 900 agtgtgtgta gtattggtgctgtgtccaaa ttagaagcta gctgaggtag cttgcagcat 960 cttttctagt tgaaatggtgaactgatagg aaaacaaatg agtagaaaga gttcatgaag 1020 aggccctcct ctgcctttcaaaaggctggt cacctacaca tgtttaaggt gtctctgcac 1080 atgtctcaag cccatcacaagaaagcaagt acagtgtgga tttcaaatgg tgtgtaactt 1140 cagctccagc tggtttttgacagctgttgc tgtggtaata tttttgacat gtgatggtga 1200 tagtctctgg ttctccccatccccacaaag gctgttgaac cacagcacca ggaagcctga 1260 gaatgaatcc tgagggctctagcccaggct ttgtcccagg ctttctggtg tgtgccctcc 1320 tggtaacagt gaaattgaagctacttactc atagtggttg tttctctggt cttgagtgac 1380 tgtgtccaca gttcatttttttccggtagg aataactcct tttctacatc cacgctccat 1440 agagtctctc cttttcagacatcctgggat gaaagaattt ggcttttttt tttctttttt 1500 ttttggacat ctgttttcactcttaggctt ttaaacaata gttattgctt ttatccctct 1560 cagattctaa taactgagagcgatggggct atattgaatc tctgtatgca ctgagaactg 1620 agctatgaag agaatcttattaaactgctg gtctgacttt atggattgac actgttcctt 1680 tcttttattg tgaaaaaaaaaaaaaaa 1707 31 2916 DNA Homo sapiens misc_feature (1)..(2916) n = a, c,g or t 31 agcagagctt tcccnccatg nnagaagctt catgagtcac acattacatctttgggttga 60 ttgaatgcca ctgaaacatt tctagtagcc tggagnagtt gacctacctgtggagatgcc 120 tgccattaaa tggcatcctg atggcttaat acacatcact cttctgtgnagggttttaat 180 tttcaacaca gcttactctg tagcatcatg tttacattgt atgtataaagattatacnaa 240 ggtgcaattg tgtatttctt ccttaaaatg tatcagtata ggatttagaatctccatgtt 300 gaaactctaa atgcatagaa ataaaaataa taaaaaattt ttcattttgccttttcagcc 360 tagtattaaa actgataaaa gcaaagccat gcacaaaact acctccctagagaaaggcta 420 gtcccttttc ttccccattc atttcattat gaacatagta gaaaacagcatattcttatc 480 aaatttgatg aaaagcgcca acacgtttga actgaaatac gacttgtcatgtgaactgta 540 ccgaatgtct acgtattcca cttttcctgc tggggttcct gtctcagaaaggagtcttgc 600 tcgtgctggt ttctattaca ctggtgtgaa tgacaaggtc aaatgcttctgttgtggcct 660 gatgctggat aactggaaaa gaggagacag tcctactgaa aagcataaaaagttgtatcc 720 tagctgcaga ttcgttcaga gtctaaattc cgttaacaac ttggaagctacctctcagcc 780 tacttttcct tcttcagtaa cacattccac acactcatta cttccgggtacagaaaacag 840 tggatatttc cgtggctctt attcaaactc tccatcaaat cctgtaaactccagagcaaa 900 tcaagaattt tctgccttga tgagaagttc ctacccctgt ccaatgaataacgaaaatgc 960 cagattactt acttttcaga catggccatt gacttttctg tcgccaacagatctggcacg 1020 agcaggcttt tactacatag gacctggaga cagagtggct tgctttgcctgtggtggaaa 1080 attgagcaat tgggaaccga aggataatgc tatgtcagaa cacctgagacattttcccaa 1140 atgcccattt atagaaaatc agcttcaaga cacttcaaga tacacagtttctaatctgag 1200 catgcagaca catgcagccc gctttaaaac attctttaac tggccctctagtgttctagt 1260 taatcctgag cagcttgcaa gtgcgggttt ttattatgtg ggtaacagtgatgatgtcaa 1320 atgcttttgc tgtgatggtg gactcaggtg ttgggaatct ggagatgatccatgggttca 1380 acatgccaag tggtttccaa ggtgtgagta cttgataaga attaaaggacaggagttcat 1440 ccgtcaagtt caagccagtt accctcatct acttgaacag ctgctatccacatcagacag 1500 cccaggagat gaaaatgcag agtcatcaat tatccatttg gaacctggagaagaccattc 1560 agaagatgca atcatgatga atactcctgt gattaatgct gccgtggaaatgggctttag 1620 tagaagcctg gtaaaacaga cagttcagag aaaaatccta gcaactggagagaattatag 1680 actagtcaat gatcttgtgt tagacttact caatgcagaa gatgaaataagggaagagga 1740 gagagaaaga gcaactgagg aaaaagaatc aaatgattta ttattaatccggaagaatag 1800 aatggcactt tttcaacatt tgacttgtgt aattccaatc ctggatagtctactaactgc 1860 cggaattatt aatgaacaag aacatgatgt tattaaacag aagacacagacgtctttaca 1920 agcaagagaa ctgattgata cgattttagt aaaaggaaat attgcagccactgtattcag 1980 aaactctctg caagaagctg aagctgtgtt atatgagcat ttatttgtgcaacaggacat 2040 aaaatatatt cccacagaag atgtttcaga tctaccagtg gaagaacaattgcggagact 2100 accagaagaa agaacatgta aagtgtgtat ggacaaagaa gtgtccatagtgtttattcc 2160 ttgtggtcat ctagtagtat gcaaagattg tgctccttct ttaagaaagtgtcctatttg 2220 taggagtaca atcaagggta cagttcgtac atttctttca tgaagaagaaccaaaacatc 2280 gtctaaactt tagaattaat ttattaaatg tattataact ttaacttttatcctaatttg 2340 gtttccttaa aatttttatt tatttacaac tcaaaaaaca ttgttttgtgtaacatattt 2400 atatatgtat ctaaaccata tgaacatata ttttttagaa actaagagaatgataggctt 2460 ttgttcttat gaacgaaaaa gaggtagcac tacaaacaca atattcaatccaaatttcag 2520 cattattgaa attgtaagtg aagtaaaact taagatattt gagttaacctttaagaattt 2580 taaatatttt ggcattgtac taataccggg aacatgaagc caggtgtggtggtatgtacc 2640 tgtagtccca ggctgaggca agagaattac ttgagcccag gagtttgaatccatcctggg 2700 cagcatactg agaccctgcc tttaaaaacn aacagnacca aanccaaacaccagggacac 2760 atttctctgt cttttttgat cagtgtccta tacatcgaag gtgtgcatatatgttgaatc 2820 acattttagg gacatggtgt ttttataaag aattctgtga gnaaaaatttaataaagcaa 2880 ccnaaattac tcttaaaaaa aaaaaaaaaa aaaaaa 2916 32 3188 DNAHomo sapiens 32 cgggcagtga cagccggcgc ggatcgcgcg tccacggagg agaatcagcttagagaacta 60 tcaacacagg acaatgcaag cccatgagct gttccggtat tttcgaatgccagagctggt 120 tgacttccga cagtgcgtga ctcttccgac caacacgctt atgggcttcggagctttttc 180 cagacgactc accaccttct ggcggccacg ccacccaaaa cccctgaagccgccatggca 240 cctctccatg cagtcagtgg aagtggcggg tagtggtggt gcacgaagatccgcactact 300 tgacagcgac gagcccttgg tgtatttcta tgatgatgtt acaacattatacgaaggttt 360 ccagagaggg atacaggtgt caaataatgg cccttgttta ggctctcggaaaccagacca 420 accctatgaa tggctttcat ataaacaggt tgcagaattg tcggagtgcataggctcagc 480 actgatccag aagggcttca agactgcccc agatcagttc attggcatctttgctcaaaa 540 tagacctgag tgggtgatta ttgaacaagg atgctttgct tattcgatggtgatcgttcc 600 actttatgat acccttggaa atgaagccat cacgtacata gtcaacaaagctgaactctc 660 tctggttttt gttgacaagc cagagaaggc caaactctta ttagagggtgtagaaaataa 720 gttaatacca ggccttaaaa tcatagttgt catggactcg tacggcagtgaactggtgga 780 acgaggccag aggtgtgggg tggaagtcac cagcatgaag gcgatggaggacctgggaag 840 agccaacaga cggaagccca agcctccagc acctgaagat cttgcagtaatttgtttcac 900 aagtggaact acaggcaacc ccaaaggagc aatggtcact caccgaaacatagtgagcga 960 ttgttcagct tttgtgaaag caacagagaa tacagtcaat ccttgcccagatgatacttt 1020 gatatctttc ttgcctctcg cccatatgtt tgagagagtt gtagagtgtgtaatgctgtg 1080 tcatggagct aaaatcggat ttttccaagg agatatcagg ctgctcatggatgacctcaa 1140 ggtgcttcaa cccactgtct tccccgtggt tccaagactg ctgaaccggatgtttgaccg 1200 aattttcgga caagcaaaca ccaccgtgaa gcgatggctc ttggactttgcctccaagag 1260 gaaagaagca gacgttcgca gcggcatcat cagaaacaac agcctgtgggaccggctgat 1320 cttccacaaa gtacagtcga gcctgggcgg aagagtccgg ctgatggtgacaggagccgc 1380 cccggtgtct gccactgtgc tgacgttcct cagagcagcc ctgggctgtcagttttatga 1440 aggatacgga cagacagagt gcactgccgg gtgctgccta accatgcctggagactggac 1500 cacaggccat gttggggccc cgatgccgtg caatttgata aaacttggttggcagttgga 1560 agaaatgaat tacatggcgt ccgagggcga gggcgaggtg tgtgtgaaagggccaaatgt 1620 atttcagggc tacttgaagg acccagcgaa aacagcagaa gctttggacaaagacggctg 1680 gttacacaca ggggacatcg gaaaatggtt accaaatggc accttgaaaattatcgaccg 1740 gaaaaagcac atatttaagc tggcacaagg agaatacata gcccctgaaaagattgaaaa 1800 tatctacatg cgaagtgagc ctgttgctca ggtgtttgtc cacggagaaagcctgcaggc 1860 atttctcatt gcaattgtgg taccagatgt tgagacatta tgttcctgggcccaaaagag 1920 aggatttgaa gggtcgtttg aggaactgtg cagaaataag gatgtcaaaaaagctatcct 1980 cgaagatatg gtgagacttg ggaaggattc tggtctgaaa ccatttgaacaggtcaaagg 2040 catcacattg caccctgaat tattttctat cgacaatggc cttctgactccaacaatgaa 2100 ggcgaaaagg ccagagctgc ggaactattt caggtcgcag atagatgacctctattccat 2160 catcaaggtt tagtgtgaag aagaaagctc agaggaaatg gcacagttccacaatctctt 2220 ctcctgctga tggccttcat gttgttaatt ttgaatacag caagtgtagggaaggaagcg 2280 ttctgtgttt gacttgtcca ttcggggttc ttctcatagg aatgctagaggaaacagaac 2340 actgccttac agtcacctca gtgttcagac catgtttatg gtaatacacacttccaaaag 2400 tagccttaaa aattgtaaag ggatactata aatgtgctaa ttatttgagacttcctcagt 2460 ttaaaaagtg ggttttaaat cttctgtctc cctgtttttc taatcaaggggttaggactt 2520 tgctatctct gagatgtctg ctacttcgtc gaaattctgc agctgtctgctgctctaaag 2580 agtacagtgc tctagaggga agtgttccct ttaaaaataa gaacaactgtcctggctgga 2640 gatctcacaa gcggaccaga gatcttttta aatccctgct actgtcccttctcacaggca 2700 ttcacagaac ccttctgatt cgaagggtta cgaaactcat gttcttctccagtcccctgt 2760 ggtttctgtt ggagcataag gtttccagta agcgggaggg cagatccaactcagaaccat 2820 gcagataagg agcctctggc aaatgggtgc tgcatcagaa cgcgtggattctctttcatg 2880 gcagatgctc ttggactcgg ttctccaggc ctgattcccc gactccatcctttttcaggg 2940 ttatttaaaa atctgcctta gattctatag tgaagacaag catttcaagaaagagttacc 3000 tggatcagcc atgctcagct gtgacgcctg ataactgtct actttatcttcactgaacca 3060 ctcactctgt gtaaaggcca acggattttt aatgtggttt tcatatcaaaagatcatgtt 3120 gggattaact tgcctttttc cccaaaaaat aaactctcag gcaaggcatttcttttaaag 3180 ctattccg 3188 33 1342 DNA Homo sapiens 33 tcccccactctcaaggatgc tgtgaggggt attcctccca tgtggtgart tgggaggwtt 60 tcctgaggtccttttccatc ctgagacgct ggttttccat tttgtttctc acaggccagg 120 gctttgaccgacacttgttt gctctgcggc atctggcagc agccaaaggg atcatcttgc 180 ctgagctctacctggaccct gcatacgggc agataaacca caatgtcctg tccacgagca 240 cactgagcagcccagcagtg aaccttgggg gctttgcccc tgtggtctct gatggctttg 300 gtgttgggtatgctgttcat gacaactgga taggctgcaa tgtctcttcc tacccaggcc 360 gcaatgcccgggagtttctc caatgtgtgg agaaggcctt agaagacatg tttgatgcct 420 tagaaggcaaatccatcaaa agttaacttc tgggcagatg aaaagctacc atcacttcct 480 catcatgaaaactgggaggc cgggcatggt ggctcatgcc tgtaatccca gcattttgag 540 aggctgaggcgggtggatca cttgaggtca ggagtttgag accaacctgg ccaacatggt 600 gaaaccttgtctctactaaa aatacaagaa ttagctgggt gtggtggcat gtgcctatat 660 cccagctactgggaggttga agcagaattg cttgaaccca ggaggtggag gttgcagtga 720 gctgagatcacaccactgca ctccggcctg ggcgacagag cgagactgtc tcaaaaagac 780 aaaaaagaaaaaaaactggg gcctgtgtag ccagtgggtg ctattctgtg aaactaatca 840 taagctgcctaggcagccag ctacaggctt gagctttaaa ttcatggttt taaagctaaa 900 cgtaatttccacttgggact agatcacaac tgaagrtaac aagagattta agttttaagg 960 gcatttaatcaggaggaaag gtttggaaaa ctaactcagg tgtatttatt gtttaagcag 1020 aaataaagtttaatttttgc ttgaagatgg ttcttaattt cttttaacct aattcctaat 1080 cctcacaaagatctttccaa cagcaagttc agtaagttca ggtaacagta cgtcaccatt 1140 ggcttctggctcattgagtg atggtgggat cgcggtttca tctctgtaaa cttgcccttg 1200 actggggagataccatctcc ttaaaaatac tcttcatttc tcctaaggag tgaactsctg 1260 ctgcacgaattcttatttgt ggagggagta gcttgctccc ttactttcac cycccatgca 1320 accagtgcagggtkaacagg gg 1342 34 4859 DNA Homo sapiens 34 cacgttgggt gacataatggggttttttta attatagatt cacactgcat ttattcatca 60 cccctgtcct ctcatccataactcaaattt actaccagca acacaaaata caaagatgtg 120 tccagtttca ctacagctcttcgcgtttac aagtgtcgag cgcttgcttt cggaacgccc 180 ttgtgattgg ccgagccaatgccagtgaca tcaaccaact tacttttgat tggaaggctg 240 gttgctggga ctgtagcgtttgcaggaagt cacttaactg tttgggagct ggaaaaccga 300 agctgaagtt ctcttttgccataggaacga gcgcaactga ctaggaaaga tgtgtcccaa 360 agctccgcaa gctggaacgtgagccaggag gcccggaccg gccacgggac cgcgaggcac 420 tccgaaagtg tgcggctgccccttccctgc ctcccagctg ttaccctttt aaatgtcagt 480 gttcgaggct gtaggggtagcacgaggcag cgaaacggaa cagtcggatt ggccgcacgc 540 ctcagttcta gacgcacctctccaccgaag ccgttctgac tggcaggggg agaaagtaaa 600 cagagttgaa tcaccctccccactggccaa ttggaggggg tttggtttgt gacgtgatgg 660 gattctgcga aattgttactgagcaagaga atgccggaac gtgcggaccg gccggagcag 720 gggttcagaa gccgtcagtggactcgggaa aaagtgtctc ttagacctgg cgctcggcgg 780 ggccctcgcc acccgcgtcggggtgatcgg gtgaatgtcc tggggctttg gctcgacggc 840 gaggcggccg agggcgtgcacctctcttgc agtttcctct cccagcgcct cgggggcgtt 900 ttcagtcgaa taaacttgcgaccgccacgt gtggcatctt tccaagggag ccggctcaga 960 ggggccggcg cgcccgtcgggggatcgcgg ccggcgcggg gcaggggcgg cggctagagg 1020 cggcggcgcg gcggagcccggggccgtgga tgctgcgtgc ggaggcgctg ccggttacgt 1080 aaagatgagg ggctgaggtcgcctcggcgc tcctgcgagt cggaagcgcc ccgcgccccc 1140 gcccccttgg ccgccgcgccgtgccgggcg ggcgggtcgt cgtccgaggc cagggagggc 1200 gagccgaacc tccgcagccaccgccaagtt tgtccgcgcc gcctgggctg ccgtcgcccg 1260 caccatgtcc gcggccgcctacatggactt cgtggctgcc cagtgtctgg tttccatttc 1320 gaaccgcgct gcggtgccggagcatggggt cgctccggac gccgagcggc tgcgactacc 1380 tgagcgcgag gtgaccaaggagcacggtga cccgggggac acctggaagg attactgcac 1440 actggtcacc atcgccaagagcttgttgga cctgaacaag taccgaccca tccagacccc 1500 ctccgtgtgc agcgacagtctggaaagtcc agatgaggat atgggatccg acagcgacgt 1560 gaccaccgaa tctgggtcgagtccttccca cagcccggag gagagacagg atcctggcag 1620 cgcgcccagc ccgctctccctcctccatcc tggagtggct gcgaagggga aacacgcctc 1680 cgaaaagagg cacaagtgcccctacagtgg ctgtgggaaa gtctatggaa aatcctccca 1740 tctcaaagcc cattacagagtgcatacagg tgaacggccc ttcccctgca cgtggccaga 1800 ctgccttaaa aagttctcccgctcagacga gctgacccgc cactaccgga cccacactgg 1860 ggaaaagcag ttccgctgtccgctgtgtga gaagcgcttc atgaggagtg accacctcac 1920 aaagcacgcc cggcggcacaccgagttcca ccccagcatg atcaagcgat cgaaaaaggc 1980 gctggccaac gctttgtgaggtgctgcccg tggaagccag ggagggatgg accccgaaag 2040 gacaaaagta ctcccaggaaacagacgcgt gaaaactgag ccccagaaga ggcacacttg 2100 acggcacagg aagtcactgctctttggtca atattctgat tttcctctcc ctgcattgtt 2160 tttaaaaagc acattgtagcctaagatcaa agtcaacaac actcggtccc cttgaagagg 2220 caactctctg aacccgtctctgactgttgg agggaaggca aatgcttttg ggttttttgg 2280 tttttgtttt tgtttttttttctcctttta tttttttgcg ggggagggta gggagtgggt 2340 gggggggagg gggtaaggccaagactgggt agattttaaa gattcaacac tggtgtacat 2400 atgtccgctg ggtgagttgacctgtggcct cgcacagtga ttctaggccc tttatgcttg 2460 ctgtctctca gaattgttttcttacctttt aatgtaatga cgagtgtgct tcagtttgtt 2520 tagcaaaacc actctcttgaatcacgttaa cttttgagat taaaaaaaaa aacgccatag 2580 cacagctgtc tttatgcaagcaagagcaca tctactccag catgatctgt catctaaaga 2640 cttgaaaaca aaaaacagttacttatagtc aatgggtaag cagagtctga atttatacta 2700 atcaagacaa acctttgaaaggttacacta agtacagaac ttttaaacct tgctttgtat 2760 gagttgtact ttttgaacataagctgcact tttattttct aatgcagagg atgaataagt 2820 taaatacatg ctttgaggatagaagcagat gttctgtttg gcaccacgtt ataatctgct 2880 tattttacaa tatacacgtttccctaagaa atcatgcgca gagatgtgag ggcagaatat 2940 acacaacaga tgctgaaggagaaggagggt agtgttttgc aaaagaaaaa gaaaagaacc 3000 aacagaattt taactctattaacttttcca aattttccta tgcttttagt taacatcatt 3060 attgtatcct aatgccactaggggagagag cttttgactc tgttgggttt tatttgaatg 3120 tgtgcataac agtaatgagatctggaaaca cctatttttt ggggaaaaag gtttgttggt 3180 ctccttcctg tgttcctacaaaactcccac tctcaggtgc aagagttatg tagaaggaaa 3240 gggagctgaa ataggaacagaaaaatcaac ccctataact agtgaacacc aagggaaaat 3300 accacaatga tttcagaggagactctgcaa aatcgtccct tgtggagaat gcaggcaaca 3360 tggaatacta cgaatgaaatcacatcactg tatcttttac atcaatagcc tcaccactaa 3420 tatatcttgt atctaggtgtctataatggc tgaaaccact acatccatct atgccattta 3480 cctgaaaact taactgtggcctttatgagg ccagaaaagt gaactgagtt ttgtagttaa 3540 gacctcaaat gaggggagtcagcagtgatc atgggggaaa tgtttacatt ttttttttct 3600 tcagaagtaa cgctttctgatgattttatc tgatatttaa aacagggagc tatggtgcac 3660 tctagtttat acttgcgctctgaaatgtgt aaacataggg tgcctaccta tttcacctga 3720 cccatactcg tttctgattcagaatcagtg tgggctcctg cagtgggcgc gggtcacggc 3780 tgactccaac ttccaatacaacagccatca ctagcacagt gtttttttgt ttaaccaacg 3840 tagtgttatt agtagttctataaagagaac tgcttttaac attagggact gggagcagtc 3900 catgggataa aaaggaaagtgttttctcac gagaaaacat gtcaggaaaa ataaagaaca 3960 ctttctacct ctgtttcagatttttgaaac acttatttta aaccaaattt taatttctgt 4020 gtccaaaata agttttaaggacatctgttc ttccatacga aataggttag gctgcctatt 4080 tctcactgag ctcatggaatggttctgctt atgatactct gcacgctgcc ttttagtgag 4140 tgaggagttt ggggttgcctagcacttgct aacttgtaaa aagtcatctt tccctcacag 4200 aaagaaacga aagaaagcaaagcaaagtca gtgaaagaca atctttatag tttcaggagt 4260 aaatctaaat gtggcttttgtcaagcactt agatggatat aaatgcagca acttgtttta 4320 aaaaaatgca catttacttcccaaaaaagt tgttacttgc cttttcaagt gtgacaaact 4380 cacatttgat attctcttatatgttatagt aatgtaacgt ataaactcaa gcctttttat 4440 tctttgtgat taaatcctgttttaaaatgt cacaaaacag gaaccagcat tctaattaga 4500 tttactatat caagatatggttcaaatagg actactagag ttcattgaac actaaaacta 4560 tgaaacaatt actttttatattaaaaagac catggattta acttatgaaa atccaaatgc 4620 aggatagtaa tttttgtttacttttttaac caaactgaat ttttgaaaga ctattgcagg 4680 tgtttaaaaa gaaagaaaagttgttttatc taatactgta agtagttgtc atattctgga 4740 aaatttaata gttttagagttaagatatct cctctctttg gttagggaag aagaaagccc 4800 ttcaccattg tggaatgatgccctggcttt aaggtttagc tccacatcat gcttctctt 4859 35 1941 DNA Homo sapiens35 tctcttgatt cctagtctct cgatatggca cctccgtcag tctttgccga ggttccgcag 60gcccagcctg tcctggtctt caagctcact gccgacttca gggaggatcc ggacccccgc 120aaggtcaacc tgggagtggg agcatatcgc acggatgact gccatccctg ggttttgcca 180gtagtgaaga aagtggagca gaagattgct aatgacaata gcctaaatca cgagtatctg 240ccaatcctgg gcctggctga gttccggagc tgtgcttctc gtcttgccct tggggatgac 300agcccagcac tcaaggagaa gcgggtagga ggtgtgcaat ctttgggggg aacaggtgca 360cttcgaattg gagctgattt cttagcgcgt tggtacaatg gaacaaacaa caagaacaca 420cctgtctatg tgtcctcacc aacctgggag aatcacaatg ctgtgttttc cgctgctggt 480tttaaagaca ttcggtccta tcgctactgg gatgcagaga agagaggatt ggacctccag 540ggcttcctga atgatctgga gaatgctcct gagttctcca ttgttgtcct ccacgcctgt 600gcacacaacc caactgggat tgacccaact ccggagcagt ggaagcagat tgcttctgtc 660atgaagcacc ggtttctgtt ccccttcttt gactcagcct atcagggctt cgcatctgga 720aacctggaga gagatgcctg ggccattcgc tattttgtgt ctgaaggctt cgagttcttc 780tgtgcccagt ccttctccaa gaacttcggg ctctacaatg agagagtcgg gaatctgact 840gtggttggaa aagaacctga gagcatcctg caagtccttt cccagatgga gaagatcgtg 900cggattactt ggtccaatcc ccccgcccag ggagcacgaa ttgtggccag caccctctct 960aaccctgagc tctttgagga atggacaggt aatgtgaaga caatggctga ccggattctg 1020accatgagat ctgaactcag ggcacgacta gaagccctca aaacccctgg gacctggaac 1080cacatcactg atcaaattgg catgttcagc ttcactgggt tgaaccccaa gcaggttgag 1140tatctggtca atgaaaagca catctacctg ctgccaagtg gtcgaatcaa cgtgagtggc 1200ttaaccacca aaaatctaga ttacgtggcc acctccatcc atgaagcagt caccaaaatc 1260cagtgaagaa acaccacccg tccagtacca ccaaagtagt tctctgtcat gtgtgttccc 1320tgcctgcaca aacctacatg tacataccat ggattagaga cacttgcagg actgaaagct 1380gctctggtga ggcagcctct gtttaaaccg gccccacatg aagagaacat cccttgagac 1440gaatttggag actgggatta gagcctttgg aggtcaaagc aaattaagat ttttatttaa 1500gaataaaaga gtactttgat catgagacat aggtatcttg tccctctcac taaaaaggag 1560tgttgtgtgt ggcggccacg tgcttctatg tggtgtttga ctctgtacaa attctagtcc 1620caaagatcaa gttgtctgaa ggagccaaag tgtgaatgtg ggtgtcggct gcggcattaa 1680attcatcatc tcaacccaga gtgtctggtc tccctgctct ttctgcatgg ttgtgtccct 1740agtcctaagc tttggttctt tagggtgact gtggtaagaa ggatatttaa tcatgacatg 1800cacggacacg tacatattta actgaaacaa gttttaccaa acagtattta ctcgtgatgt 1860gcgtagtgca ttctgatatt tttgagccat tctattgtgt tctacttcac ctaaaaaaat 1920aaaataaaaa tgttgatcaa g 1941 36 2727 DNA Homo sapiens 36 agaagagcggagctgtgagc agtactgcgg cctcctctcc tctcctaacc tcgctctcgc 60 ggcctagctttacccgcccg cctgctcggc gaccagaaca ccttccacca tgaccacctc 120 agcaagttcccacttaaata aaggcatcaa gcaggtgtac atgtccctgc ctcagggtga 180 gaaagtccaggccatgtata tctggatcga tggtactgga gaaggactgc gctgcaagac 240 ccggaccctggacagtgagc ccaagtgtgt ggaagagttg cctgagtgga atttcgatgg 300 ctctagtactttacagtctg agggttccaa cagtgacatg tatctcgtgc ctgctgccat 360 gtttcgggaccccttccgta aggaccctaa caagctggtg ttatgtgaag ttttcaagta 420 caatcgaaggcctgcagaga ccaatttgag gcacacctgt aaacggataa tggacatggt 480 gagcaaccagcacccctggt ttggcatgga gcaggagtat accctcatgg ggacagatgg 540 gcacccctttggttggcctt ccaacggctt cccagggccc cagggtccat attactgtgg 600 tgtgggagcagacagagcct atggcaggga catcgtggag gcccattacc gggcctgctt 660 gtatgctggagtcaagattg cggggactaa tgccgaggtc atgcctgccc agtgggaatt 720 tcagattggaccttgtgaag gaatcagcat gggagatcat ctctgggtgg cccgtttcat 780 cttgcatcgtgtgtgtgaag actttggagt gatagcaacc tttgatccta agcccattcc 840 tgggaactggaatggtgcag gctgccatac caacttcagc accaaggcca tgcgggagga 900 gaatggtctgaagtacatcg aggaggccat tgagaaacta agcaagcggc accagtacca 960 catccgtgcctatgatccca agggaggcct ggacaatgcc cgacgtctaa ctggattcca 1020 tgaaacctccaacatcaacg acttttctgc tggtgtagcc aatcgtagcg ccagactacg 1080 cattccccggactgttggcc aggagaagaa gggttacttt gaagatcgtc gcccctctgc 1140 caactgcgagcccttttcgg tgacagaagc cctcatccgc acgtgtcttc tcaatgaaac 1200 cggcgatgagcccttccagt acaaaaatta agtggactag acctccagct gttgagcccc 1260 tcctagttcttcatccctga ctccaactct tccccctctc ccagttgtcc cgattgtaac 1320 tcaaagggtggaatatcaag gtcgtttttt tcattccatg tgcccagtta atcttgcttt 1380 cttttgtttggctgggatag aggggtcaag ttattaattt cttcacacct accctccttt 1440 ttttccctatcactgaagct ttttagtgca ttagtgggga ggagggtggg gagacataac 1500 cactgcttccatttaatggg gtgcacctgt ccaataggcg tacgtatccg gacagagcac 1560 gtttgcagaggggtctctct ccaggtagct gaaagggaag acctgacgta ctctggttag 1620 gttaggacttgccctcgtgg tggaaacttt tcttaaaaag ttataaccaa cttttctatt 1680 aaaagtgggaattaggagag aaggtagggg ttgggaatca gagagaatgg ctttggtctc 1740 ttgcttgtgggactagcctg gcttgggact aaatgccctg ctctgaacac aagcttagta 1800 taaactgatggatatcccta ccttgaaaga agaaaaggtt cttactgctt ggtccttgat 1860 ttatcacacaaagcagaata gtatttttat atttaaatgt aaagacaaaa aactatatgt 1920 atggttttgtggattatgtg tgttttggct aaaggaaaaa accatccagg tcacggggca 1980 ccaaatttgagacaaatagt cggattagaa ataaagcatc tcattttgag tagagagcaa 2040 ggaagtggttcttagatggt gatctgggat taggccctca agaccccttt tgggtttctg 2100 ccctgcccaccctctggaga aggtggcact gattagttaa cagaccaaca ccgttactag 2160 cagtcactgatctccgtggc tttggtttaa aagacacact tgtccacata ggtttagaga 2220 taagagttggctggtcaact tgagcatgtt actgacagag ggggtattgg ggttattttc 2280 tggtaggaatagcatgtcac taaagcaggc ctttgatatt aaatttttta aaaagcaaaa 2340 ttatagaagtttagatttta atcaaatttg tagggtttct aggtatttac agatgctgtt 2400 gctcaacgtctcctacctct gctctgagag atgggacagg ctgagtcaaa cactgtaatt 2460 ttgtatcttgatgtctttgt taagactgct gaagaattat tttttctttt ataataagga 2520 ataaaccccacctttattcc ttcatttcat ctaccatttt ctggttcttg tgttggctgt 2580 ggcaggccagctgtggtttt cttttgccat gacaacttct aattgccatg tacagtatgt 2640 tcaaagtcaaataactcctc attgtaaaca aactgtgtaa ctgcccaaag cagcacttat 2700 aaatcagcctaacataaaaa aaaaaaa 2727 37 831 DNA Homo sapiens 37 gttgacaaga gacattccagcccaccactt cccaagtaaa gaattaaaat gcagcatgat 60 ggctaaggca agggcctgcagaagaatgta aaggagggag gaagagcagg ggattcagag 120 caggaaggag gagacagtactgtctatccc gcagacgtgg tgctctttga agggatcctg 180 gccttctact cccaggaaaggtacgagacc tgttccagat gaagcttttt gtggatacag 240 atgcggacac ccggctctcacgcagagtat taagggacat cagcgagaga ggcagggatc 300 ttgagcagat tttatctcagtacattacgt tcgtcaagcc tgcctttgag gaattctgct 360 tgccaacaaa gcagtatgctgatgtgatca tccctagagg tgcagataat ctggtggcca 420 tcaacctcat cgagcagcacatccaggaca tcctgaatgg agggccctcc aaacggcaga 480 ccaatggctg tctcaacggctacacccctt cacgcaagag gcaggcatcg gagtccagca 540 gcaggccgca ttgacccgtctccatcggac cccagcccct atctccaaga gacagaggag 600 gcgtcaggag gcactgctcatctgtacata ctgtttccta tgacattact gtatttaaga 660 aaacaccatg gagatgaaatgcctttgatt ttttttttct ttttgtactt tggaacgaca 720 aaatgaaaca gaacttgaccctgagcttaa ataacaaaac tgtgccaact actactggtg 780 atgcctaatt atgaatccaacgtgtaacca gtaataaata catatatata t 831 38 3288 DNA Homo sapiens 38cttcctctcc acgcggttga gaagaccggt cggcctgggc aacctgcgct gaagatgccg 60ggaaaactcc gtagtgacgc tggtttggaa tcagacaccg caatgaaaaa aggggagaca 120ctgcgaaagc aaatcgagga gaaagagaaa aaagagaagc caaaatctga taagactgaa 180gagatagcag aagaggaaga aactgttttc cccaaagcta aacaagttaa aaagaaagca 240gagccttctg aagttgacat gaattctcct aaatccaaaa aggcaaaaaa gaaagaggag 300ccatctcaaa atgacatttc tcctaaaacc aaaagtttga gaaagaaaaa ggagcccatt 360gaaaagaaag tggtttcttc taaaaccaaa aaagtgacaa aaaatgagga gccttctgag 420gaagaaatag atgctcctaa gcccaagaag atgaagaaag aaaaggaaat gaatggagaa 480actagagaga aaagccccaa actgaagaat ggatttcctc atcctgaacc ggactgtaac 540cccagtgaag ctgccagtga agaaagtaac agtgagatag agcaggaaat acctgtggaa 600caaaaagaag gcgctttctc taattttccc atatctgaag aaactattaa acttctcaaa 660ggccgaggag tgaccttcct atttcctata caagcaaaga cattccatca tgtttacagc 720gggaaggact taattgcaca ggcacggaca ggaactggga agacattctc ctttgccatc 780cctttgattg agaaacttca tggggaactg caagacagga agagaggccg tgcccctcag 840gtactggttc ttgcacctac aagagagttg gcaaatcaag taagcaaaga cttcagtgac 900atcacaaaaa agctgtcagt ggcttgtttt tatggtggaa ctccctatgg aggtcaattt 960gaacgcatga ggaatgggat tgatatcctg gttggaacac caggtcgtat caaagaccac 1020atacagaatg gcaaactaga tctcaccaaa cttaagcatg ttgtcctgga tgaagtggac 1080cagatgttgg atatgggatt tgctgatcaa gtggaagaga ttttaagtgt ggcatacaag 1140aaagattctg aagacaatcc ccaaacattg cttttttctg caacttgccc tcattgggta 1200tttaatgttg ccaagaaata catgaaatct acatatgaac aggtggacct gattggtaaa 1260aagactcaga aaacggcaat aactgtggag catctggcta ttaagtgcca ctggactcag 1320agggcagcag ttattgggga tgtcatccga gtatatagtg gtcatcaagg acgcactatc 1380atcttttgtg aaaccaagaa agaagcccag gagctgtccc agaattcagc tataaagcag 1440gatgctcagt ccttgcatgg agacattcca cagaagcaaa gggaaatcac cctgaaaggt 1500tttagaaatg gtagttttgg agttttggtg gcaaccaatg ttgctgcacg tgggttagac 1560atccctgagg ttgatttggt tatacaaagc tctccaccaa aggatgtaga gtcctacatt 1620catcgatccg ggcggacagg cagagctgga aggacggggg tgtgcatctg cttttatcag 1680cacaaggaag aatatcagtt agtacaagtg gagcaaaaag cgggaattaa gttcaaacga 1740ataggtgttc cttctgcaac agaaataata aaagcttcca gcaaagatgc catcaggctt 1800ttggattccg tgcctcccac tgccattagt cacttcaaac aatcagctga gaagctgata 1860gaggagaagg gagctgtgga agctctggca gcagcactgg cccatatttc aggtgccacg 1920tccgtagacc agcgctcctt gatcaactca aatgtgggtt ttgtgaccat gatcttgcag 1980tgctcaattg aaatgccaaa tattagttat gcttggaaag aacttaaaga gcagctgggc 2040gaggagattg attccaaagt gaagggaatg gtttttctca aaggaaagct gggtgtttgc 2100tttgatgtac ctaccgcatc agtaacagaa atacaggaga aatggcatga ttcacgacgc 2160tggcagctct ctgtggccac agagcaacca gaactggaag gaccacggga aggatatgga 2220ggcttcaggg gacagcggga aggcagtcga ggcttcaggg gacagcggga cggaaacaga 2280agattcagag gacagcggga aggcagtaga ggcccgagag gacagcgatc aggaggtggc 2340aacaaaagta acagatccca aaacaaaggc cagaagcgga gtttcagtaa agcatttggt 2400caataattag aaatagaaga tttatatagc aaaaagagaa tgatgtttgg caatatagaa 2460ctgaacatta tttttcatgc aaagttaaaa gcacattgtg cctccttttg accacttgcc 2520aagtccctgt ctctttcaga cacagacaag cttcatttaa attatttcat ctgatcatta 2580tcatttataa ctttattgtt acttcttcat cagtttttcc ttttgaaagg tgtatgaatt 2640cattacattt ttattctaat gtattatctg tagattagaa gataaaatca agcatgtatc 2700tgcctatact ttgtgagttc acctgtcttt atactcaaaa gtgtccctta atagtgtcct 2760tccctgaaat aaatacctaa gggagtgtaa cagtctctgg aggaccactt tgagcctttg 2820gaagttaagg tttcctcagc cacctgccga acagtttctc atgtggtcct attatttgtc 2880tactgagact taatactgag caatgttttg aaacaagatt tcaaactaat ctgggttgta 2940atacagttta taccagtgta tgctctagac ttggaagatg tagtatgttt gatgtggatt 3000acctatactt atgttcgttt tgatacattt ttagcttctc attataaggt gattcatgct 3060ttagtgaatt cttatagatg atatataaaa gtacatttta atagaagcca gggtttaagg 3120aatttcacat gtataaggtg gctccatagc tttatttgta agtaggctgg ataaatggtg 3180cttaaatggt aatgtactcc acttcttccc attggaagat taacattatt taccaagaag 3240gacttaaggg agtagggggc gcagattagc attgctcaag agtatgga 3288 39 3442 DNAHomo sapiens 39 agccggtgcg ccgcagacta gggcgcctcg ggccagggag cgcggaggagccatggccac 60 cgctaacggg gccgtggaaa acgggcagcc ggacgggaag ccgccggccctgccgcgccc 120 catccgcaac ctggaggtca agttcaccaa gatatttatc aacaatgaatggcacgaatc 180 caagagtggg aaaaagtttg ctacatgtaa cccttcaact cgggagcaaatatgtgaagt 240 ggaagaagga gataagcccg acgtggacaa ggctgtggag gctgcacaggttgccttcca 300 gaggggctcg ccatggcgcc ggctggatgc cctgagtcgt gggcggctgctgcaccagct 360 ggctgacctg gtggagaggg accgcgccac cttggccgcc ctggagacgatggatacagg 420 gaagccattt cttcatgctt ttttcatcga cctggagggc tgtattagaaccctcagata 480 ctttgcaggg tgggcagaca aaatccaggg caagaccatc cccacagatgacaacgtcgt 540 atgcttcacc aggcatgagc ccattggtgt ctgtggggcc atcactccatggaacttccc 600 cctgctgatg ctggtgtgga agctggcacc cgccctctgc tgtgggaacaccatggtcct 660 gaagcctgcg gagcagacac ctctcaccgc cctttatctc ggctctctgatcaaagaggc 720 cgggttccct ccaggagtgg tgaacattgt gccaggattc gggcccacagtgggagcagc 780 aatttcttct caccctcaga tcaacaagat cgccttcacc ggctccacagaggttggaaa 840 actggttaaa gaagctgcgt cccggagcaa tctgaagcgg gtgacgctggagctgggggg 900 gaagaacccc tgcatcgtgt gtgcggacgc tgacttggac ttggcagtggagtgtgccca 960 tcagggagtg ttcttcaacc aaggccagtg ttgcacggca gcctccagggtgttcgtgga 1020 ggagcaggtc tactctgagt ttgtcaggcg gagcgtggag tatgccaagaaacggcccgt 1080 gggagacccc ttcgatgtca aaacagaaca ggggcctcag attgatcaaaagcagttcga 1140 caaaatctta gagctgatcg agagtgggaa gaaggaaggg gccaagctggaatgcggggg 1200 ctcagccatg gaagacaagg ggctcttcat caaacccact gtcttctcagaagtcacaga 1260 caacatgcgg attgccaaag aggagatttt cgggccagtg caaccaatactgaagttcaa 1320 aagtatcgaa gaagtgataa aaagagcgaa tagcaccgac tatggactcacagcagccgt 1380 gttcacaaaa aatctcgaca aagccctgaa gttggcttct gccttagagtctggaacggt 1440 ctggatcaac tgctacaacg ccctctatgc acaggctcca tttggtggctttaaaatgtc 1500 aggaaatggc agagaactag gtgaatacgc tttggccgaa tacacagaagtgaaaactgt 1560 caccatcaaa cttggcgaca agaacccctg aaggaaaggc ggggctccttcctcaaacat 1620 cggacggcgg aatgtggcag atgaaatgtg ctggaggaaa aaaatgacatttctgacctt 1680 cccgggacac attcttctgg aggctttaca tctactggag ttgaatgattgctgttttcc 1740 tctcactctc ctgtttattc accagactgg ggatgcctat aggttgtctgtgaaatcgca 1800 gtcctgcctg gggagggagc tgttggccat ttctgtgttt ccctttaaaccagatcctgg 1860 agacagtgag atactcaggg cgttgttaac agggagtggt atttgaagtgtccagcagtt 1920 gcttgaaatg ctttgccgaa tctgactcca gtaagaatgt gggaaaaccccctgtgtgtt 1980 ctgcaagcag ggctcttgca ccagcggtct cctcagggtg gacctgcttacagagcaagc 2040 cacgcctctt tccgaggtga aggtgggacc attccttggg aaaggattcacagtaaggtt 2100 ttttggtttt tgttttttgt tttcttgttt ttaaaaaaag gatttcacagtgagaaagtt 2160 ttggttagtg cataccgtgg aagggcgcca gggtctttgt ggattgcatgttgacattga 2220 ccgtgagatt cggcttcaaa ccaatactgc ctttggaata tgacagaatcaatagcccag 2280 agagcttagt caaagacgat atcacggtct accttaacca aggcactttcttaagcagaa 2340 aatattgttg aggttacctt tgctgctaaa gatccaatct tctaacgccacaacagcata 2400 gcaaatccta ggataattca cctcctcatt tgacaaatca gagctgtaattcactttaac 2460 aaattacgca tttctatcac gttcactaac agcttatgat aagtctgtgtagtcttcctt 2520 ttctccagtt ctgttaccca atttagatta gtaaagcgta cacaactggaaagactgctg 2580 taataacaca gccttgttat ttttaagtcc tattttgata ttaatttctgattagttagt 2640 aaataacacc tggattctat ggaggacctc ggtcttcatc caagtggcctgagtatttca 2700 ctggcaggtt gtgaattttt cttttcctct ttgggaatcc aaatgatgatgtgcaatttc 2760 atgttttaac ttgggaaact gaaagtgttc ccatatagct tcaaaaacaaaaacaaatgt 2820 gttatccgac ggatactttt atggttacta actagtactt tcctaattgggaaagtagtg 2880 cttaagtttg caaattaagt tggggagggc aataataaaa tgagggcccgtaacagaacc 2940 agtgtgtgta taacgaaaac catgtataaa atgggcctat cacccttgtcagagatataa 3000 attaccacat ttggcttccc ttcatcagct aacacttatc acttatactaccaataactt 3060 gttaaatcag gatttggctt catacactga attttcagta ttttatctcaagtagatata 3120 gacactaacc ttgatagtga tacgttagag ggttcctatt cttccattgtacgataatgt 3180 ctttaatatg aaatgctaca ttatttataa ttggtagagt tattgtatctttttatagtt 3240 gtaagtacac agaggtggta tatttaaact tctgtaatat actgtatttagaaatggaaa 3300 tatatatagt gttaggtttc acttctttta aggtttaccc ctgtggtgtggtttaaaaat 3360 ctataggcct gggaattccg atcctagctg cagatcgcat cccacaatgcgagaatgata 3420 aaataaaatt ggatatttga ga 3442 40 1540 DNA Homo sapiens40 gccctcggcc ccgggccggc ccgccccgcc tcggccgccg cctggcgagc cgccgggtcc 60ccgctcggcc ggtggccgag gccggagggc cgcggcgggc ggcggccgag gcggctccgg 120ccagggccgg gccgggggcc ggggggcggc ggcgggcagg cggccgcgtc ggccggggcc 180gggacgatga ctctggagtc catgatggcg tgttgcctga gcgatgaggt gaaggagtcc 240aagcggatca acgccgagat cgagaagcag ctgcggcggg acaagcgcga cgcccggcgc 300gagctcaagc tgctgctgct cggcacgggc gagagcggga agagcacgtt catcaagcag 360atgcgcatca tccacggcgc cggctactcg gaggaggaca agcgcggctt caccaagctc 420gtctaccaga acatcttcac cgccatgcag gccatgatcc gggccatgga gacgctcaag 480atcctctaca agtacgagca gaacaaggcc aatgcgctcc tgatccggga ggtggacgtg 540gagaaggtga ccaccttcga gcatcagtac gtcagtgcca tcaagaccct gtgggaggac 600ccgggcatcc aggaatgcta cgaccgcagg cgcgagtacc agctctccga ctctgccaag 660tactacctga ccgacgttga ccgcatcgcc accttgggct acctgcccac ccagcaggac 720gtgctgcggg tccgcgtgcc caccaccggc atcatcgagt accctttcga cctggagaac 780atcatcttcc ggatggtgga tgtggggggc cagcggtcgg agcggaggaa gtggatccac 840tgctttgaga acgtgacatc catcatgttt ctcgtcgccc tcagcgaata cgaccaagtc 900ctggtggagt cggacaacga gaaccggatg gaggagagca aagccctgtt ccggaccatc 960atcacctacc cctggttcca gaactcctcc gtcatcctct tcctcaacaa gaaggacctg 1020ctggaggaca agatcctgta ctcgcacctg gtggactact tccccgagtt cgatggtccc 1080cagcgggagc cccaggcggc gcgggagttc atcctgaaga tgttcgtgga cctgaacccc 1140gacagcgaca agatcatcta ctcacacttc acgtgtgcca ccgacacgga gaacatccgc 1200ttcgtgttcg cggccgtgaa ggacaccatc ctgcagctca acctcaagga gtacaacctg 1260gtctgagcgc cccaggccca gggagacggg atggagacac ggggcaggac cttccttcca 1320cggagcctgc gctgccgggc gggtggcgct gccgagtccg ggccggggct ctgccgcggg 1380aggagatttt ttttttttca tatttttaac aaatggtttt tatttcacag ttatcagggg 1440atgtacatct ctccctccgt acacttcgcg caccttctca ccttttgtca acggcaaagg 1500cagccttttt ctggccttga cttatggctc gcttttttct 1540 41 1517 DNA Homosapiens 41 attctttggg gaggcaacta ggatggtgtg gccgaccacg gatttgcattgccgaggacg 60 ggaccccagg gcagcgaagc agaatggcca acatgcaggg actggtggaaagactggaac 120 gagctgtcag ccgcctggag tcgctgtctg cagagtccca caggccccctgggaactgcg 180 gggaagtcaa tggtgtcatt gcaggtgtgg caccctccgt ggaagcctttgacaagctga 240 tggacagtat ggtggccgag tttttaaaga acagtaggat ccttgctggggacgtggaga 300 cccatgcaga aatggtgcac agtgctttcc aggcccagcg ggctttccttctgatggcct 360 ctcagtacca acaaccccac gagaatgacg tggccgcact tctgaaacccatatcggaaa 420 agattcagga aatccaaact ttcagagaga gaaaccgggg gagtaacatgtttaatcatc 480 tttcggccgt cagcgaaagc atccctgccc ttggatggat agctgtgtctcccaaacctg 540 gtccttatgt caaggagatg aatgacgctg ccacctttta cactaacagggtcttaaagg 600 actacaaaca cagtgatttg cgtcatgtgg attgggtgaa gtcatatttgaacatttgga 660 gtgaacttca agcatacatc aaggaacacc acaccacggg cctcacatggagcaaaacag 720 gtcctgtagc atccacagta tcagcgtttt ctgtcctctc ctctgggcctggccttcctc 780 caccccctcc tcctctgcct cctccagggc cacctccact tttcgagaatgaaggcaaaa 840 aagaggaatc ttctccttca cgctcagctt tatttgccca acttaaccagggagaagcaa 900 ttacaaaagg gctccgccat gtcacagatg accagaagac atacaaaaatcccagcctgc 960 gggctcaagg agggcaaact caatctccca ccaaaagtca cactccaagtcccacatctc 1020 ctaaatctta tccttctcaa aaacatgccc cagtgttgga gttggaaggaaagaaatgga 1080 gagtggagta ccaagaggac aggaatgacc ttgtgatttc agagactgagctgaaacaag 1140 tggcttacat tttcaaatgc gaaaaatcaa ctattcagat aaaagggaaagtaaactcca 1200 ttataattga caactgtaag aaactcggcc tggtgtttga caatgtggtgggcattgtgg 1260 aagtgatcaa ctcccaggac attcaaatcc aggtaatggg gagagtgccaacaatttcca 1320 ttaataagac agaaggttgc cacatatacc tcagtgaaga tgcattagactgtgagatcg 1380 tgagcgccaa gtcatctgaa atgaacatac ttatccctca ggatggtgattatagagaat 1440 ttcccattcc tgaacagttc aagacagcat gggatggatc caagttaatcactgaacctg 1500 cagaaattat ggcctaa 1517 42 1616 DNA Homo sapiens 42tgctgaacca tttttcttag gatgcagccg tctcactccc ttgtcctgta aatcgtgtat 60tcatgttgat gattcttgga gataggtttc actttttccc agctgcgtcc acaggaaagg 120ggagtcggat gccagctgca ccccgcctgg ctcgcacagg ctaagaccac agacagagca 180gggcttcccg gagccacaca ggccacgcac cccaggaacc cttgctgccg cgggccagga 240acaggaatgt gttggtgcct gagacaccaa atggaagaag cacatcaaga ctgttctcct 300gcggccaaca ctggcccgga agccgccctc catacaggcc ctcagggggc ctgccttctg 360cgcctcagtc ccccgtgcat ccctgggcct gggtatcaca tgctctccag gaaagggacg 420gaatcaatcg tgtgaccgat gggctcgcaa ggatgggtgc cgccgtggga gccctgcctc 480tggtgctggc aagggattgg gtttgtgtgg gtgtctctag cctgcagagt gcagtgagtg 540agagtccttg ggagcgcggc gctgcctgta gctgtgcctg gggatgcacg tggccacggg 600atttcagtgg gacagcgctc ccacaggggc tgggggtggg ggtggggttt cttagttact 660gttggaaagg gaaaaattca ccatatccaa ggggagagac gatgggctgg gtttgtttac 720tccaacttcc cttctacacc cctcctgcag gacagtacga tttggggaga acccagctcc 780ccactttatc tgcagactct gggacctgac aaaacagtca gagcctgagt gcactgcagc 840ctgaactccc ttgagcagcg ctataaggga ctttgcactt taaaaagggg atgcctgtca 900gtaaatcccc tgtgcattga ctagaactgg ggggctgcgc ccgctccctc cttaatccta 960gatgatttgc tcatgaaata gaggtggggg acgaccgcat gcactctggg aggtgcagcc 1020ctaaggggtg gactccagat ctccctgcaa gagacagctt ggcttggctt tggctgttgg 1080ggaggagtcc ctgccatccc ggtgagcctg gggctgttgc ttagggtctt ctgggtggac 1140acgtggagaa agagaaggca aacgttggaa cactaggaaa agctagaaat tcagacaaca 1200cacatggatc cccttaaaac atgtaaatgt gtcagaacac ggttgacctg ccgccttctt 1260gaacctggtg gcccccgttg gaactatcag tggcgtctcc catgcacacg ccctctgctt 1320tctctttcct agactcgcgg tgctcacatc cagacattac cttgttggta gcccccaagt 1380ggcgtgcagt gacaccagta tcttctctgt tgcatttttg caatcttgtg tcccgctcgg 1440tgatgttcta caactctgtt ttaaggttga gaaagtttca agggtgaaga tctcaaaaca 1500gtgctaaaat caaaggtgtt tgctgtgaag aaaaacatgt gtatatattg caccttgagt 1560tgtcagaagg tagaaactga aataaactaa ctttaaaaaa aaaaaaaaaa aaaaaa 1616 432408 DNA Homo sapiens 43 ccgcgcctcc tcggccgcct gtcgggcatg aaaaccaaattctgcaccgg gggcgaggcg 60 gagccctcgc cgctcgggct gctgctgagc tgcggtagcggcagcgcggc cccggcgccc 120 ggcgtggggc agcagcgcga cgccgccagc gacctcgagtccaagcagct ggcgccaaca 180 gccgcgctcg cgctgccccc tccgccgccg ctgccgctgccgctgccgct gccccagccc 240 ccgccgccgc agccgcccgc agacgagcag ccggagccccgggcgcggcg cagggcctat 300 ctgtggtgca aggagttcct gcccggcgcc tggcggggcctccgcgagga cgagttccac 360 atcagtgtca tcagaggcgg ccttagcaac atgctgttccagtgctccct acctgacacc 420 acagccaccc ttggtgatga gcctcggaaa gtgctcctgcggctgtatgg agcgattttg 480 cagatgaggt cctgtaataa agagggatcc gaacaagctcagaaagaaaa tgaatttcaa 540 ggggctgagg ccatggttct ggagagcgtt atgtttgccattctcgcaga gaggtcactt 600 gggccaaaac tctatggcat ctttccccaa ggccgactggagcagttcat cccgagccgg 660 cgattagata ctgaagaatt aagtttgcca gatatttctgcagaaatcgc cgagaaaatg 720 gctacatttc atggtatgaa aatgccattc aataaggaaccaaaatggct ttttggcaca 780 atggaaaagt atctaaagga agtgctgaga attaaatttactgaggaatc cagaattaaa 840 aagctccaca aattgctcag ttacaatctg cccttggaactggaaaacct gagatcattg 900 cttgaatcta ctccatctcc agttgtattt tgtcataatgactgtcaaga aggtaatatc 960 ttgttgctgg aaggccgaga gaattctgaa aaacagaaactgatgctcat tgatttcgaa 1020 tacagcagtt acaattacag gggattcgac attggaaatcacttctgtga gtggatgtat 1080 gattatagct atgaaaaata cccttttttc agagcaaacatccggaagta tcccaccaag 1140 aaacaacagc tccattttat ttccagttac ttgcctgcattccaaaatga ctttgaaaac 1200 ctcagtactg aagaaaaatc cattataaaa gaagaaatgttgcttgaagt taataggttt 1260 gcccttgcat ctcatttcct ctggggactg tggtccattgtacaagccaa gatttcatct 1320 attgaatttg ggtacatgga ctacgcccaa gcaaggtttgatgcctattt ccaccagaag 1380 aggaagcttg gggtgtgact gtggggagga ctccatccacctcatcactg gactgcatgg 1440 ggaggcagca gagcgcggtc ccctctgtgc ttcgactactgctcctgtgg caggaggctt 1500 tgggtggctc actactgaac acatgtgtat gatactaaagacggtattaa aatggagcga 1560 cgtttatttc atctcttgtt tacgatttca ctaggactcagaaacgagat cgggaagacg 1620 aaatatagtg caatagtgca acatctctga atccttttaatctagagaag gcatttcata 1680 tttgggggct aaggtttcca gtcagatgag gcaaacagcaagagtaagca gtgttacttg 1740 caggtacttt ggttaatgtt gactttaaat tttcatgaatgtgctggtga acactgtgac 1800 caggcttttg tagatggcga ctgtgttata gacggtgctcactcccaagg gacagcaagt 1860 gagcagagat gtactgcaaa gtcgccagtc actgcgtgcaaggtggcctc tgcctggggc 1920 cgtccagaag ctgctccttt accctcttgg tcccatggctgaagcggagc agctggattg 1980 ctctggagca gccaaggccg ccactgtgga gacagagctctcccctcctg ctgggcgtgt 2040 gtgacactgt agagtttcac tgtactcgat gtgacttctcccctgccctt cctcctgatg 2100 gagtgtgcag acagccatgc gtggccacgg gggcagtgtgaggacctccc tgtctcccgc 2160 tcccctccca gggagcagct gcttgaccta gctctttgggcctctcctgc cctctgctct 2220 gcctggagtg tcggatcctg tgagtaggct gggcctcccctgggcagggt tctccaaggc 2280 cggtttcccg gcccttacca aacctgatgc ccctgacatcatcattcttg tgggagacag 2340 cagcctgtat gtggtgtggg gcgtggatcg agtgtagctgtgaaatccat atatatgaaa 2400 tgtccaat 2408 44 1610 DNA Homo sapiensmisc_feature (1)..(1610) n = a, c, g or t 44 cgtaacagga caaggagtcctgctccggca cgtggccaca gaaaactact taggaagcct 60 gtggtgagaa caacaacagtgcctggagaa tcccacggct ctggggaagt gagccccgag 120 gatgaggctg ctcgcctggctgattttcct ggctaactgg ggaggtgcca gggctgaacc 180 agggaagttc tggcacatcgctgacctgca ccttgaccct gactacaagg tatccaaaga 240 ccccttccag gtgtgcccatcagctggatc ccagccagtg cccgacgcag gcccctgggg 300 tgactacctc tgtgattctccctgggccct catcaactcc tccatctatg ccatgaagga 360 gattgagcca gagccagacttcattctctg gactggtgat gacacgcctc atgtgcccga 420 tgagaaactg ggagaggcagctgtactgga aattgtggaa cgcctgacca agctcatcag 480 agaggtcttt ccagatactaaagtctatgc tgctttggga aatcatgatt ttcaccccaa 540 aaaccagttc ccagctggaagtaacaacat ctacaatcag atagcagaac tatggaaacc 600 ctggcttagt aatgagtccatcgctctctt caaaaaaggt gccttctact gtgagaagct 660 gccgggtccc agcggggctgggcgaattgt ggtcctcaac accaatctgt actataccag 720 caatgcgctg acagcagacatggcggaccc tggccagcag ttccagtggc tggaagatgt 780 gctgaccgat gcatccaaagctggggacat ggtgtacatt gtcggccacg tgcccccggg 840 gttctttgag aagacgcaaaacaaggcatg gttccgggag ggcttcaatg aaaaatacct 900 gaaggtggtc cggaagcatcatcgcgtcat agcagggcag ttcttcgggc accaccacac 960 cgacagcttt cggatgctctatgatgatgc aggtgtcccc ataagcgcca tgttcatcac 1020 acctggagtc accccatggaaaaccacatt acctggagtg gtcaatgggg ccaacaatcc 1080 agccatccgg gtgttcgaatatgaccgagc cacactgagc ctnnaggaca tggtgaccta 1140 cttcatgaac ctgagccaggcgaatgctca ggggacgccg cgctgggagc tcgagtacca 1200 gctgaccgag gcctatggggtgccggacgc cagcgcccac tccatcgaca cagtgctgga 1260 ccgcatcgct ggcgaccagagcacactgca gcgctactac gtctataact cagtcagcta 1320 ctctgctggg gtctgcgacgaggcctgcag catgcagcac gtgtgtgcca tgcgccaggt 1380 ggacattgac gcttacaccacctgtctgta tgcctctggc accacgcccg tgccccagct 1440 nccgntgctg ctgatggccctgctggggct gtgcacgact cgtgctgtga cctgccaggc 1500 tcaccattct tcctggtaacgggtaacggg ggcagcgccc aggatcaccc agagctgggc 1560 cttccaccat ttcctccgcgcctgaggagt gaactgaatg gacaccgatc 1610 45 1882 DNA Homo sapiens 45gggcaggaag acggcgctgc ccggaggagc ggggcgggcg ggcgcgcggg ggagcgggcg 60gcgggcggga gccaggcccg ggcgggggcg ggggcggcgg ggccagaaga ggcggcgggc 120cgcgctccgg ccggtctgcg gcgttggcct tggctttggc tttggcggcg gcggtggaga 180agatgctgca gtccctggcc ggcagctcgt gcgtgcgcct ggtggagcgg caccgctcgg 240cctggtgctt cggcttcctg gtgctgggct acttgctcta cctggtcttc ggcgcagtgg 300tcttctcctc ggtggagctg ccctatgagg acctgctgcg ccaggagctg cgcaagctga 360agcgacgctt cttggaggag cacgagtgcc tgtctgagca gcagctggag cagttcctgg 420gccgggtgct ggaggccagc aactacggcg tgtcggtgct cagcaacgcc tcgggcaact 480ggaactggga cttcacctcc gcgctcttct tcgccagcac cgtgctctcc accacaggtt 540atggccacac cgtgcccttg tcagatggag gtaaggcctt ctgcatcatc tactccgtca 600ttggcattcc cttcaccctc ctgttcctga cggctgtggt ccagcgcatc accgtgcacg 660tcacccgcag gccggtcctc tacttccaca tccgctgggg cttctccaag caggtggtgg 720ccatcgtcca tgccgtgctc cttgggtttg tcactgtgtc ctgcttcttc ttcatcccgg 780ccgctgtctt ctcagtcctg gaggatgact ggaacttcct ggaatccttt tatttttgtt 840ttatttccct gagcaccatt ggcctggggg attatgtgcc tggggaaggc tacaatcaaa 900aattcagaga gctctataag attgggatca cgtgttacct gctacttggc cttattgcca 960tgttggtagt tctggaaacc ttctgtgaac tccatgagct gaaaaaattc agaaaaatgt 1020tctatgtgaa gaaggacaag gacgaggatc aggtgcacat catagagcat gaccaactgt 1080ccttctcctc gatcacagac caggcagctg gcatgaaaga ggaccagaag caaaatgagc 1140cttttgtggc cacccagtca tctgcctgcg tggatggccc tgcaaaccat tgagcgtagg 1200atttgttgca ttatgctaga gcaccagggt cagggtgcaa ggaagaggct taagtatgtt 1260catttttatc agaatgcaaa agcgaaaatt atgtcacttt aagaaatagc tactgtttgc 1320aatgtcttat taaaaaacaa caaaaaaaga cacatggaac aaagaagctg tgaccccagc 1380aggatgtcta atatgtgagg aaatgagatg tccacctaaa attcatatgt gacaaaatta 1440tctcgacctt acataggagg agaatacttg aagcagtatg ctgctgtggt tagaagcaga 1500ttttatactt ttaactggaa actttggggt ttgcatttag atcatttagc tgatggctaa 1560atagcaaaat ttatatttag aagcaaaaaa aaaaagcata gagatgtgtt ttataaatag 1620gtttatgtgt actggtttgc atgtacccac ccaaaatgat tatttttgga gaatctaagt 1680caaactcact atttataatg cataggtaac cattaactat gtacatataa agtataaata 1740tgtttatatt ctgtacatat ggtttaggtc accagatcct agtgtagttc tgaaactaag 1800actatagata ttttgtttct tttgatttct ctttatacta aagaatccag agttgctaca 1860ataaaataag gggaataata aa 1882 46 1805 DNA Homo sapiens 46 aagagactgaactgtatctg cctctatttc caaaagactc acgttcaact ttcgctcaca 60 caaagccgggaaaattttat tagtcctttt tttaaaaaaa gttaatataa aattatagca 120 aaaaaaaaaaggaacctgaa ctttagtaac acagctggaa caatcgcagc ggcggcggca 180 gcggcgggagaagaggttta atttagttga ttttctgtgg ttgttggttg ttcgctagtc 240 tcacggtgatggaagctgca cattttttcg aagggaccga gaagctgctg gaggtttggt 300 tctcccggcagcagcccgac gcaaaccaag gatctgggga tcttcgcact atcccaagat 360 ctgagtgggacatacttttg aaggatgtgc aatgttcaat cataagtgtg acaaaaactg 420 acaagcaggaagcttatgta ctcagtgaga gtagcatgtt tgtctccaag agacgtttca 480 ttttgaagacatgtggtacc accctcttgc tgaaagcact ggttcccctg ttgaagcttg 540 ctagggattacagtgggttt gactcaattc aaagcttctt ttattctcgt aagaatttca 600 tgaagccttctcaccaaggg tacccacacc ggaatttcca ggaagaaata gagtttctta 660 atgcaattttcccaaatgga gcaggatatt gtatgggacg tatgaattct gactgttggt 720 acttatatactctggatttc ccagagagtc gggtaatcag tcagccagat caaaccttgg 780 aaattctgatgagtgagctt gacccagcag ttatggacca gttctacatg aaagatggtg 840 ttactgcaaaggatgtcact cgtgagagtg gaattcgtga cctgatacca ggttctgtca 900 ttgatgccacaatgttcaat ccttgtgggt attcgatgaa tggaatgaaa tcggatggaa 960 cttattggactattcacatc actccagaac cagaattttc ttatgttagc tttgaaacaa 1020 acttaagtcagacctcctat gatgacctga tcaggaaagt tgtagaagtc ttcaagccag 1080 gaaaatttgtgaccaccttg tttgttaatc agagttctaa atgtcgcaca gtgcttgctt 1140 cgccccagaagattgaaggt tttaagcgtc ttgattgcca gagtgctatg ttcaatgatt 1200 acaattttgtttttaccagt tttgctaaga agcagcaaca acagcagagt tgattaagaa 1260 aaatgaagaaaaaacgcaaa aagagaacac atgtagaagg tggtggatgc tttctagatg 1320 tcgatgctgggggcagtgct ttccataacc accactgtgt agttgcagaa agccctagat 1380 gtaatgatagtgtaatcatt ttgaattgta tgcattatta tatcaaggag ttagatatct 1440 tgcatgaatgctctcttctg tgtttaggta ttctctgcca ctcttgctgt gaaattgaag 1500 tggatgtagaaaaaaccttt tactatatga aactttacaa cacttgtgaa agcaactcaa 1560 tttggtttatgcacagtgta atatttctcc aagtatcatc caaaattccc cacagacaag 1620 gctttcgtcctcattaggtg ttggcctcag cctaaccctc taggactgtt ctattaaatt 1680 gctgccagaattttacatcc agttacctcc actttctaga acatattctt tactaatgtt 1740 attgaaaccaatttctactt catactgatg tttttggaaa cagcaattaa agtttttctt 1800 ccatg 180547 2653 DNA Homo sapiens 47 gagcgcggct ggagtttgct gctgccgctg tgcagtttgttcaggggctt gtggcggtga 60 gtccgagagg ctgcgtgtga gagacgtgag aaggatcctgcactgaggag gtggaaagaa 120 gaggattgct cgaggaggcc tggggtctgt gagacagcggagctgggtga aggctgcggg 180 ttccggcgag gcctgagctg tgctgtcgtc atgcctcaaacccgatccca ggcacaggct 240 acaatcagtt ttccaaaaag gaagctgtct cgggcattgaacaaagctaa aaactccagt 300 gatgccaaac tagaaccaac aaatgtccaa accgtaacctgttctcctcg tgtaaaagcc 360 ctgcctctca gccccaggaa acgtctgggc gatgacaacctatgcaacac tccccattta 420 cctccttgtt ctccaccaaa gcaaggcaag aaagagaatggtccccctca ctcacataca 480 cttaagggac gaagattggt atttgacaat cagctgacaattaagtctcc tagcaaaaga 540 gaactagcca aagttcacca aaacaaaata ctttcttcagttagaaaaag tcaagagatc 600 acaacaaatt ctgagcagag atgtccactg aagaaagaatctgcatgtgt gagactattc 660 aagcaagaag gcacttgcta ccagcaagca aagctggtcctgaacacagc tgtcccagat 720 cggctgcctg ccagggaaag ggagatggat gtcatcaggaatttcttgag ggaacacatc 780 tgtgggaaaa aagctggaag cctttacctt tctggtgctcctggaactgg aaaaactgcc 840 tgcttaagcc ggattctgca agacctcaag aaggaactgaaaggctttaa aactatcatg 900 ctgaattgca tgtccttgag gactgcccag gctgtattcccagctattgc tcaggagatt 960 tgtcaggaag aggtatccag gccagctggg aaggacatgatgaggaaatt ggaaaaacat 1020 atgactgcag agaagggccc catgattgtg ttggtattggacgagatgga tcaactggac 1080 agcaaaggcc aggatgtatt gtacacgcta tttgaatggccatggctaag caattctcac 1140 ttggtgctga ttggtattgc taataccctg gatctcacagatagaattct acctaggctt 1200 caagctagag aaaaatgtaa gccacagctg ttgaacttcccaccttatac cagaaatcag 1260 atagtcacta ttttgcaaga tcgacttaat caggtatctagagatcaggt tctggacaat 1320 gctgcagttc aattctgtgc ccgcaaagtc tctgctgtttcaggagatgt tcgcaaagca 1380 ctggatgttt gcaggagagc tattgaaatt gtagagtcagatgtcaaaag ccagactatt 1440 ctcaaaccac tgtctgaatg taaatcacct tctgagcctctgattcccaa gagggttggt 1500 cttattcaca tatcccaagt catctcagaa gttgatggtaacaggatgac cttgagccaa 1560 gagggagcac aagattcctt ccctcttcag cagaagatcttggtttgctc tttgatgctc 1620 ttgatcaggc agttgaaaat caaagaggtc actctggggaagttatatga agcctacagt 1680 aaagtctgtc gcaaacagca ggtggcggct gtggaccagtcagagtgttt gtcactttca 1740 gggctcttgg aagccagggg cattttagga ttaaagagaaacaaggaaac ccgtttgaca 1800 aaggtgtttt tcaagattga agagaaagaa atagaacatgctctgaaaga taaagcttta 1860 attggaaata tcttagctac tggattgcct taaattcttctcttacaccc cacccgaaag 1920 tattcagctg gcatttagag agctacagtc ttcattttagtgctttacac attcgggcct 1980 gaaaacaaat atgacctttt ttacttgaag ccaatgaattttaatctata gattctttaa 2040 tattagcaca gaataatatc tttgggtctt actatttttacccataaaag tgaccaggta 2100 gacccttttt aattacattc actacttcta ccacttgtgtatctctagcc aatgtgcttg 2160 caagtgtaca gatctgtgta gaggaatgtg tgtatatttacctcttcgtt tgctcaaaca 2220 tgagtgggta tttttttgtt tgtttttttt gttgttgttgtttttgaggc gcgtctcacc 2280 ctgttgccca ggctggagtg caatggcgcg ttctctgctcactacagcac ccgcttccca 2340 ggttgaagtg attctcttgc ctcagcctcc cgagtagctgggattacagg tgcccaccac 2400 cgcgcccagc taatttttta atttttagta gagacagggttttaccatgt tggccaggct 2460 ggtcttgaac tcctgaccct caagtgatct gcccaccttggcctccctaa gtgctgggat 2520 tataggcgtg agccaccatg ctcagccatt aaggtattttgttaagaact ttaagtttag 2580 ggtaagaaga atgaaaatga tccagaaaaa tgcaagcaagtccacatgga gatttggagg 2640 acactggtta aag 2653 48 1618 DNA Homo sapiens48 atgtcccggc cgcagcttcg acgctggcgc ctcgtctcta gcccgccgag cggcgtcccg 60ggtctagcgc tgctggcgct gctggcgctg ctggcgctgc ggctcgcggc cgggaccgac 120tgcccatgcc cggagcctga gctctgccgc ccgattcgcc accatccaga tttcgaggtc 180tttgtgtttg atgttggaca gaaaacttgg aaatcttatg attggtcaca gattacaact 240gtggcaacat ttggaaaata tgactcagaa cttatgtgct acgctcattc aaaaggagcc 300agagtagtac ttaaaggaga tgtatcctta aaggatatca ttgatcctgc tttcagagca 360tcctggatag ctcaaaaact taatttggcc aaaacacaat atatggatgg aattaatata 420gatatagagc aagaagttaa ttgtttatca cctgaatatg atgcattaac tgctttagtc 480aaagaaacta cagactcttt ccatcgtgaa attgagggat cacaggtaac ctttgatgta 540gcttggtctc caaagaacat agacagaaga tgctataatt atactggaat cgcagatgct 600tgtgacttcc tctttgtgat gtcttatgat gaacaaagtc agatctggtc agaatgtatt 660gcagcagcca atgctcccta taatcagaca ttaactggat ataatgacta catcaagatg 720agcattaatc ctaagaaact tgtaatgggt gttccttggt atggttatga ttatacctgc 780ctgaatctgt ctgaggatca tgtttgtacc attgcaaaag tccctttccg gggggctcct 840tgtagtgacg ctgcaggacg tcaggtgccc tacaaaacga tcatgaagca aataaatagt 900tctatttctg gaaacctatg ggataaagat cagcgggctc cttattataa ctataaagat 960cctgctggcc actttcatca agtatggtat gataaccctc agagtatttc tttaaaggca 1020acatatatac aaaactatcg cttacggggc attggcatgt ggaatgcaaa ctgtcttgac 1080tactctggag atgctgtagc caaacagcaa actgaagaaa tgtgggaagt cttaaagcca 1140aagctgttac agagatgaac atcttttgtc aaaccattaa gagttagaaa gatgatctgt 1200atcaacagat ctagtttctt gcatttttat tatgttgcta tatacttttg ttatccgtat 1260actaaaaaaa aagaataaat aaatgttttg attgtttgaa tttgaaaaat acacacgaat 1320gtcctcagta tccaggaaca taaaggcaag aagcaagtca acttacctat taaatattcc 1380tctattagat gtttcaacac tataatttaa ttgggaaaaa ttgctttcag aattttatta 1440tgccatattt cccttcatta tagtaaaata tatgctcacg aatcaatgct gatttttaaa 1500atatgtataa tctgaagtgg aaattgtttg cttagagttt ttaaaaacct agtctttgaa 1560aagcagtttg tgctatactt ttcccccaac cctccaataa atcttaaatt taaaacct 1618 494814 DNA Homo sapiens 49 ggcggcggga gccctggaac ggagcttcgt ggagctaagcggagctgagc gcgaaaggcc 60 gaggcacttt cgggaattca cagtctgcag cattgggactgcaaatgccg tggctggcgc 120 cgtaaaatac agtgaaagcg cgggaggctt ttactacgtggagagtggca agttgttctc 180 cgtaaccaga aacaggttca ttcattggaa gacctctggagatacattgg agctgatgga 240 ggagtcactg gacataaatc tgttgaataa tgccattcgcctaaaattcc aaaattgcag 300 tgttttacct ggaggggttt atgtctctga gactcagaatcgtgtgataa tcttgatgtt 360 aaccaatcaa acagtgcaca ggttactttt accacacccctcccggatgt ataggagtga 420 gttggtagtt gacagtcaga tgcagtcaat attcactgacattggaaaag ttgatttcac 480 agatccttgc aactatcagt taattccagc agtacctggaatatctccta attccaccgc 540 ctctacagcc tggctcagca gtgatgggga ggccctgtttgccttaccat gtgcttctgg 600 gggaatcttt gttcttaagc tacctcctta tgacatacctggtatggtgt cagtcgtgga 660 actgaaacag agttcagtaa tgcaacgatt gcttacaggctggatgccaa cagctatcag 720 gggtgaccag tcgccttcag atcgtcccct cagtcttgctgttcattgtg tggagcatga 780 tgccttcatc tttgctttgt gtcaggatca taaactacgaatgtggtctt acaaggagca 840 aatgtgccta atggtagctg acatgctgga gtatgtccctgtgaagaaag accttcggct 900 tactgctgga actggacaca aattacggct tgcttattcccccaccatgg gactctacct 960 ggggatatac atgcatgcac caaaacgagg acagttctgcattttccagt tggtgagcac 1020 tgagagtaat cgctatagtc tcgatcatat ttcttcactgttcacttctc aggagacact 1080 gattgacttt gccttaactt ccacggatat ctgggccctgtggcatgatg ctgagaacca 1140 aacagtagtg aaatacatca actttgaaca taatgttgcaggtcagtgga atccagtttt 1200 tatgcagcct ctgccagagg aagagattgt catcagagatgatcaagacc ccagagagat 1260 gtatctgcaa agtcttttta caccaggaca attcacaaatgaagctttat gtaaggcttt 1320 acagattttc tgccgaggaa ctgagaggaa tttggatctttcctggagtg aactgaagaa 1380 agaagttact ttagctgttg aaaatgagct tcaaggaagtgtaacagagt atgaattctc 1440 ccaggaggag tttcgaaatt tacaacaaga attctggtgcaagttctatg cctgttgtct 1500 tcagtatcaa gaagccctct ctcaccctct tgccctacatttgaatccac acacaaacat 1560 ggtgtgcctg ctgaaaaaag ggtacctgtc tttccttattccctcatcct tagtggatca 1620 tttgtatctc ctgccttatg agaacctttt gacagaagatgagacaacca tatctgatga 1680 tgtggatatc gctcgggatg tcatatgtct tataaaatgcctccggctga ttgaagagtc 1740 agtaactgtg gatatgtcag ttataatgga aatgagttgttataacctac agtctccgga 1800 aaaggctgca gagcagattc tggaagatat gatcactattgatgtagaaa atgtgatgga 1860 ggatatttgt agtaaactgc aagagattag gaacccaatccatgcaattg gactacttat 1920 acgggaaatg gattatgaaa cagaagtgga aatggaaaagggattcaatc cagctcagcc 1980 tttgaatatt cgaatgaatc ttacccagct ctatggtagtaacacagcag ggtatattgt 2040 gtgcagaggg gtgcataaaa tcgccagtac tcgtttcctgatctgcagag atcttttgat 2100 cttacagcag ctgttaatga ggcttggaga tgctgtgatttggggaactg gtcagctctt 2160 tcaagctcag caagacctac tacatcgaac agctcccctactcttatctt attacctcat 2220 taaatgggga agtgagtgct tggcaactga tgttccacttgacacactgg agtctaatct 2280 ccaacactta tcagtactgg aattaacaga ctctggtgctttaatggcaa ataggtttgt 2340 atctagtcct cagactattg tggagttatt cttccaagaagttgcaagaa aacacattat 2400 atctcacctc ttctctcagc caaaggcacc tctgagccaaactggattga attggcctga 2460 aatgattact gcaattacca gttatttatt gcagcttttatggcctagca atcctggttg 2520 tctctttcta gaatgtttga tgggaaattg ccaatatgtacaattgcagg attatattca 2580 actgctacat ccctggtgtc aagtcaatgt tggttcctgtcgatttatgc tgggaaggtg 2640 ttacctagtt acaggagaag gacagaaggc tctggaatgtttttgtcagg cagcatctga 2700 agtaggcaaa gaggaattct tggatcgctt gattcgctcagaggatgggg agatcgtgtc 2760 tacccccagg ctgcagtatt atgacaaggt tttacgactactagatgtca ttggtttgcc 2820 tgaactggtt attcagttgg ctacatcagc cataactgaagcaagtgatg actggaaaag 2880 tcaggctact ctaaggacat gtattttcaa acatcatttggatttgggtc acaatagcca 2940 agcatatgaa gccttaaccc aaattcctga ttccagcaggcaattagatt gtttacggca 3000 gttggtggta gttctttgtg aacgctcaca gctacaggatcttgtagagt ttccctatgt 3060 gaatctgcat aatgaggttg tgggaataat tgagtcacgtgctagagctg tggaccttat 3120 gactcacaat tactatgaac ttctgtatgc ctttcacatctatcgccaca attaccgcaa 3180 ggctggcaca gtgatgtttg agtatggaat gcggcttggcagagaagttc gaactctccg 3240 gggacttgag aaacaaggca actgttatct ggctgctctcaattgtttac gacttattcg 3300 tccagaatat gcgtggattg tgcagccagt gtctggtgcagtgtatgatc gccctggagc 3360 atcccctaag aggaatcatg atggagaatg cacagctgcccccacaaatc gacaaattga 3420 aatcctggaa ctggaagatc tggagaaaga gtgttccttggctcgcatcc gcctcacttt 3480 ggctcagcat gatccatcag cggttgcagt tgctggaagttcatcagcag aggaaatggt 3540 cactctcttg gttcaggcgg gcctctttga cactgccatatcactctgtc agacttttaa 3600 gcttccctta acgccagtct ttgaagggct tgccttcaaatgcatcaaat tgcaatttgg 3660 aggagaggca gcacaagcag aagcctgggc ctggctagcagccaatcagc tctcatctgt 3720 catcactact aaggagtcta gtgctacaga tgaagcatggcgactattat ccacttacct 3780 ggagaggtac aaagtccaga ataacttgta tcaccactgtgtaatcaaca agctcttgtc 3840 tcatggagtg cctctgccta attggcttat aaacagtcacaacatcgcac tgtcccaaaa 3900 agttgataag gcaacacggg atttattata tcgtcggaccttgtgatttg gattgtcacc 3960 tagcctttgt aaccgcttgg tgcctcttag gacttaagactaccctacag gaaccctgta 4020 ctcaaggccg atttttgtaa ctgtaaatga tgtgtacaacattcaagtct gcattctgca 4080 caagatagga gggcggaaga gtcagaggac cctgtgcttgctggtggtgc taacacaatt 4140 tctggtgttc aaccttggtc tcaaatagct gcttttgtatatgattcacg agctttttta 4200 gagtttatat ttttttaaac taccgaagac attcattatctgcaaattaa gactcacctt 4260 cactttccaa aatagctgag ggttgttggc ttgttgtagctgaccaccaa aagcagtcac 4320 tgcaaatctt ttaattcttc cctatcacct tttgtattttaatgcaatta ttttggtcca 4380 gaactgacct gtattttctg tattgtacac aaaagctaataattttgtgt actttttatt 4440 tattttggag gttttatatg atcttcaatt gagtattaaataatttgcct agattaagcc 4500 taaaatgatg accagctaat taaagaagat attttgaatctggttctgag ctaaagttga 4560 gtaaattctt agctaagaaa aaattggaaa tccatcatctatattagcaa cagattctca 4620 gagtaaattg ttaacttcta tgatttatga taatcaagctggacttgatc atacaagtta 4680 gtctcataat gtattggacc aaaatgtaaa cttcattggtcagatttaga agcattcatg 4740 ctcacaagtt ttgggaaagt gaaaaataat aaaatcatcttggattttat tctgtatatt 4800 aaaatttatc tttt 4814 50 6493 DNA Homo sapiensmisc_feature (1)..(6493) n = a, c, g or t 50 gaattcaagt cttgttcctgcacattccac cctggagaaa tctggggcaa gtgactgttc 60 cccgggcctt agcttctcctgtcactggga catcacaaca gcacctacct tagggcaact 120 caggccaggg aagttggtgctgcctcacct cccaatgtgc gtcctcctgg gcctggagcc 180 tcagggcctc tggaaggaggaagtgagcgc ctctgggcag gattcctggg aggcctggga 240 gagcaaggga agcgccaagagctgagcaga gttctgggac tgatccatgg ccctttctct 300 ctcacctttc aggaggtgggccccctccac ccccagcact tcccacctgg tcggtcccga 360 acggcccctc cccggaggaggtggagcagc agaaaaggtg gggctgggcc ctgggtgggg 420 aaccttagcc gctgccagagttccatatgt tctggaaccc ttgactccta gagttcagaa 480 cccagccaac ttgcagttttcagaatgttc aagaaacttc tgacactcag agttgcagaa 540 cctcctggtc cctgcagattcctggaaatc agaatatggt ggttgaaaga atcttgtggc 600 tgggcgtggt ggctcacgcctgtaatccca gcactctggg aggccgaggc gggcagatcg 660 cctgaggtca ggagtttgagaccagcctgg ccaacatggc gaaatcccgt ctctactgaa 720 gataacaaaa attagccggtcatggtggcg cccgtgcctg taatcccagc tcggcaggcc 780 gaggcaggag aatcgcttgaacccgggagg cagaggttgc agtgagccaa gatcgagcca 840 ctgcactcca gcctgggtgacagagtctca aaaaaaaaaa aagaaaagaa agaatcttgg 900 gcattttgta attcggtgttcctgacagtt tagtgactgg gatctcgcat cctgatctct 960 ccctgtcgct gccctgccctccattccccc tactctcacc cagccccctt cttggttccc 1020 taggggagga aggcttgggtgagtattagg agccagccac cctggagacc tctgagagag 1080 aggacggagg tcgctggccccttcgctggc catccttagg accctgattg acggcagctc 1140 tctcgcctcc ccccacaggcagcagcccgg cccgtcggag cacatagagc gccgggtctc 1200 caatgcaggt gatgctcagatagcttcggg agttgggagg gggcctccct ggaggaagtg 1260 gccagccagc tggacagtgaagaatgaggc ttctctctct cagctgcccc cttttctgtg 1320 tttgtttcag gaggcccacctgctcccccc gctgggggtc cacccccacc accaggacct 1380 ccccctcctc caggtccccccccaccccca ggtttgcccc cttcnggggt cccagctgca 1440 gcgcacggag cagggggaggaccaccccct gcaccccctc tcccggcagc acagggccct 1500 ggtggtgggg gagctggggccccaggcctg gccgcagcta ttgctggagc caaactcagg 1560 aaagtcagca aggtgaggggccgggagagg tgggcagggg gcaacagggc ttttatgggg 1620 gatgaggcca gggctgccggcggtgtcatt gggctggaag gccaaaaggc ctgcccctaa 1680 agctcctgcc ccttttaaatttctccagca ggaggaggcc tcaggggggc ccacagcccc 1740 caaagctgag agtggtcgaagcggaggtgg gggactcatg gaagagatga acgccatgct 1800 ggcccggagg tgagcctgagcctggacccc caagtcacct ggagttccag ttcagtaggg 1860 cccagtcaga ggagggctccaattcctgtt tagtttgttt cttttggtga atgttccccc 1920 tttgataacc aggtttgggatataatggtg gggtttgtca tgaaatgcct gaggcttgca 1980 accacctagg tagcctgtagatgttctaaa acccagaatt ctagaaccgt aggagatctt 2040 tcctcagaat tctgggaactcaggttcctg caatctcagt gttccaacac agcaccgctc 2100 caccctcgga atcttactgttccctaatat aagaatcata gaacctcctc caccctgatt 2160 ctagaaccac aatctcttgaattttttttt tttttttttt tttttttttg agatggagtc 2220 ttgctctgtc acccaggctggagtgcagtg gtatgatctc ggtccactgc aacctccgcc 2280 tcctgggttc aggcagttcttctgcctcag cctcctaagt agccgggatt acaggcatga 2340 gtcaccacac ccggctaatttttgtatttt tagtagacac aggatttcac catgttggcc 2400 aggctggtct tgaactcttgacctcaagag atccacctgc ttcagcctct caaagtgttg 2460 gcattacagg ccactgcgcccagcacaatc tcttgaattt ctaaaactag agtttcctta 2520 ggttttcgga gttccagaattctatgcgct aggatctaca tttctagaac tcccctcaga 2580 aggggatggg ttgggtgacggaagcacgtg tttttgcttt tctctcctgc agaaggaaag 2640 ccacgcaagt tggggagaaaacccccaagg atgaatctgc caatgtaagt cagggactct 2700 tcttgcccta catctcttaggccgtaccat gagggtaggg atagtgggat gtgtggggtt 2760 tgaacctgaa agaggaaatgggcagaggtg tggcaggggc tggctcatgg cagttttatt 2820 tcctaccagc aggaggagccagaggccaga gtcccggccc agagtggtga gtagagtgcc 2880 cagtccagcc acaggaactacaaatcccag aatactctgt tctcacatgt taagcaccct 2940 tataggagag tcagggcgaatggtgctggg gattgtagtc tcctgagatg gggctttgat 3000 caggggctga tgaggttgggggagtaagat tgattggggg gcagtctttt gtccctgatc 3060 tttctgattt cttgcctatccccagaatct gtgcggagac cctgggagaa gaacagcaca 3120 accttgccaa ggtaggccatcggtcctggg gcccttgggg aggtaaaggc gggcagatcg 3180 cttgagccca ggaggtcaagaccagcctgg gcaacatggc gacaccccat ctctacaaaa 3240 attagccagg cgtggtagcacttacctgtg gtcccagcta ctcaggaggc tgaggtggga 3300 ggattacttg agcccaggaagttgaggcct cagcgagcca tcatcatgcc tgcactccag 3360 cctgagaaat agaatgtgactgtctcaaaa caaaacacaa caaaccaaaa ccaaaaaaaa 3420 aaaaaactgg ggccccaaaaatacttggac ttgcccaatt tataaggcag agctcaatgt 3480 gatccctgga ataggaggcggggaagcagg tcctctctct aatctcattg ctgtcccaaa 3540 ccacaccaac tcccccaggatgaagtcgtc ttcttcggtg accacttccg agacccaacc 3600 ctgcacgccc agctccagtgattactcgga cctacagagg gtgaaacagg taacttgggg 3660 gggaagttgg ggaccacagcaagagagatc taggtctggc ccctgccact ggcatgccgt 3720 atgatcctag ataacatctcagaaacctca ggtttccaat ctgacaaatg gagaaactgg 3780 attgggtcaa ggatgaccgagactccacac ccccttttct ggcacctgtg acagacatta 3840 ttaatctatc accgcgctcattccagatga gtgccttgaa ttctttccgc acattgaccc 3900 agctgtccat caccaattggagttggcagg aggctggaat gcgcttgcca accttggtac 3960 tggatgttct ccagtacttttccggctcca aggatccaga attctcccct agaatcctcc 4020 agtcactctg cgaccttgacagcgatgtca tggtgtcgat gtaggggtag gtctcaaacc 4080 tactccccct ggcttttccatcaacaagaa agaggggact ctggcagggc acggtggctc 4140 atgtgtgtaa tttcagcacattgcgaggct gaggtgggag cattgcttga ggccaggagt 4200 ttgagaccag cctggggcaacatcgggaga cccccatctc taaaaataac ttttaaaagt 4260 tacctgagaa ggccaggtgcggtggctcat gcctgtaatc ccagcacttt gggaggccga 4320 ggtgggtgga tcacctgaggtcaggggttc aagaccagcc tggccaacat ggtgaaaccc 4380 atcgctacta aaaatacaaaaattaggctg ggaatggtgg ctcacagcca taatcccagc 4440 agtttggaag gctgatggggacggatcacg tgaagtcaaa agttcgagac cagcctggcc 4500 aacatggcga aaccctgtctctactaaaaa tacaaaaatt agctgggcct tgtggggggc 4560 acctgtaatc cagttatttgggcggctgag gcaggagaat cgcttgaacc cgggagccag 4620 agattgcagt cagccgagattgggccactg cactgcagtc tgggtgacag ggagactctg 4680 tttcaaaaaa aaaaagaaaaagaaaaagtt acctgattgt ggcggcaggt gactgtggtc 4740 ccagctactt gggaggctaaggcaggagga ttacctgagc ctgggaagtt gaggctgcaa 4800 tgagctgtga tcatgccattgcaccctagc ctaggcaaca gagcaaggtt ccttctcaaa 4860 aaataaaaga agggggattcattcctgcaa gtcccggtac ccctcctgat tagttttacc 4920 ccattaattt taggagcttctggaagaggt gaagaaggaa ttgcagaaag tgaaagagga 4980 aatcattgaa ggtgaggtggtttgctttgg ttttgttctt aaacatttac ttattttgga 5040 ggcatcatgt ccctgggcaagagccctgtt ttggaaggga ggaggcagag actctgcccc 5100 tgacctctgc tccttgtttccttccagcct tcgtccagga gctgaggaag cggggttctc 5160 cctgaccaca gggacccagaagacccgctt ctcctttccg cacacccggc ctgtcaccct 5220 gctttccctg cctctacttgacttggaatt ggctgaagac tacacaggaa tgcatcgttc 5280 ccactcccca tcccacttggaaaactccaa gggggtgtgg cttccctgct cacacccaca 5340 ctggctgctg attggctggggaggcccccg cccttttctc cctttggtcc ttcccctctg 5400 ccatcccctt ggggccggtccctctgctgg ggatgcacca atgaacccca caggaagggg 5460 gaaggaagga gggaatttcacattcccttg ttctagattc actttaacgc ttaatgcctt 5520 caaagttttg gtttttttaagaaaaaaaaa tatatatata tttgggtttt gggggaaaag 5580 ggaaattttt ttttctctttggttttgata aaatgggatg tgggagtttt taaatgctat 5640 agccctgggc ttgccccatttggggcagct atttaagggg aggggatgtc tcaccgggct 5700 gggggtgaga catccccccaccccagggac tccccttccc tctggctcct tccccttttc 5760 tatgaggaaa taagatgctgtaactttttg gaacctcagt tttttgattt tttatttggg 5820 taggttttgg ggtccaggccatttttttta ccccttggag gaaataagat gagggagaaa 5880 ggaaaagggg aggaaacttctcccctccca ccttcacctt tagcttcttg aaaatgggcc 5940 cctgcagaat aaatctgccagtttttataa atgctaagat ctctggagtg atttgaaggc 6000 ctgttctgat ggggatggaggtgtgctcgg cccccggtgc ccctccagga agatttggtc 6060 ctctgctgag aacccctgcctcctcccagg aatccacctt cccttcatct tccttcccac 6120 cctgcatatt gcgcctgctcactcatcctc aggcccgcag ccaggatgat ctctgccccc 6180 tccagcctcc ctccccatgccccttaggag gccacttcct ccccatccca ccctgccctt 6240 caccacccta ggggaggccagaagcagcct cactttgtgt agccttgggc aagtccattt 6300 gcttacctca ggcctcagtttctgatttgg gaaagggctc ataagatgat tctctgcccc 6360 cactctacca ctctcccagcttctttcctc tttttttttt tttttttttt ttaatgagtt 6420 ggggtcttgc tctttcacccagtctggagt gtagtggcag gatcacagct cactgcagcc 6480 ttgaactcct ggg 6493 515629 DNA Homo sapiens 51 gcgcgaccgt cccgggggtg gggccgggcg cagcggcgagaggaggcgaa ggtggctgcg 60 gtagcagcag cgcggcagcc tcggacccag cccggagcgcagggcggccg ctgcaggtcc 120 ccgctcccct ccccgtgcgt ccgcccatgg ccgccgccgggcagctgtgc ttgctctacc 180 tgtcggcggg gctcctgtcc cggctcggcg cagccttcaacttggacact cgggaggaca 240 acgtgatccg gaaatatgga gaccccggga gcctcttcggcttctcgctg gccatgcact 300 ggcaactgca gcccgaggac aagcggctgt tgctcgtgggggccccgcgc ggagaagcgc 360 ttccactgca gagagccaac agaacgggag ggctgtacagctgcgacatc accgcccggg 420 ggccatgcac gcggatcgag tttgataacg atgctgaccccacgtcagaa agcaaggaag 480 atcagtggat gggggtcacc gtccagagcc aaggtccagggggcaaggtc gtgacatgtg 540 ctcaccgata tgaaaaaagg cagcatgtta atacgaagcaggaatcccga gacatctttg 600 ggcggtgtta tgtcctgagt cagaatctca ggattgaagacgatatggat gggggagatt 660 ggagcttttg tgatgggcga ttgagaggcc atgagaaatttggctcttgc cagcaaggtg 720 tagcagctac ttttactaaa gactttcatt acattgtatttggagccccg ggtacttata 780 actggaaagg gattgttcgt gtagagcaaa agaataacactttttttgac atgaacatct 840 ttgaagatgg gccttatgaa gttggtggag agactgagcatgatgaaagt ctcgttcctg 900 ttcctgctaa cagttactta ggtttttctt tggactcagggaaaggtatt gtttctaaag 960 atgagatcac ttttgtatct ggtgctccca gagccaatcacagtggagcc gtggttttgc 1020 tgaagagaga catgaagtct gcacatctcc tccctgagcacatattcgat ggagaaggtc 1080 tggcctcttc atttggctat gatgtggcgg tggtggacctcaacaaggat gggtggcaag 1140 atatagttat tggagcccca cagtattttg atagagatggagaagttgga ggtgcagtgt 1200 atgtctacat gaaccagcaa ggcagatgga ataatgtgaagccaattcgt cttaatggaa 1260 ccaaagattc tatgtttggc attgcagtaa aaaatattggagatattaat caagatggct 1320 acccagatat tgcagttgga gctccgtatg atgacttgggaaaggttttt atctatcatg 1380 gatctgcaaa tggaataaat accaaaccaa cacaggttctcaagggtata tcaccttatt 1440 ttggatattc aattgctgga aacatggacc ttgatcgaaattcctaccct gatgttgctg 1500 ttggttccct ctcagattca gtaactattt tcagatcccggcctgtgatt aatattcaga 1560 aaaccatcac agtaactcct aacagaattg acctccgccagaaaacagcg tgtggggcgc 1620 ctagtgggat atgcctccag gttaaatcct gttttgaatatactgctaac cccgctggtt 1680 ataatccttc aatatcaatt gtgggcacac ttgaagctgaaaaagaaaga agaaaatctg 1740 ggctatcctc aagagttcag tttcgaaacc aaggttctgagcccaaatat actcaagaac 1800 taactctgaa gaggcagaaa cagaaagtgt gcatggaggaaaccctgtgg ctacaggata 1860 atatcagaga taaactgcgt cccattccca taactgcctcagtggagatc caagagccaa 1920 gctctcgtag gcgagtgaat tcacttccag aagttcttccaattctgaat tcagatgaac 1980 ccaagacagc tcatattgat gttcacttct taaaagagggatgtggagac gacaatgtat 2040 gtaacagcaa ccttaaacta gaatataaat tttgcacccgagaaggaaat caagacaaat 2100 tttcttattt accaattcaa aaaggtgtac cagaactagttctaaaagat cagaaggata 2160 ttgctttaga aataacagtg acaaacagcc cttccaacccaaggaatccc acaaaagatg 2220 gcgatgacgc ccatgaggct aaactgattg caacgtttccagacacttta acctattctg 2280 catatagaga actgagggct ttccctgaga aacagttgagttgtgttgcc aaccagaatg 2340 gctcgcaagc tgactgtgag ctcggaaatc cttttaaaagaaattcaaat gtcacttttt 2400 atttggtttt aagtacaact gaagtcacct ttgacaccccatatctggat attaatctga 2460 agttagaaac aacaagcaat caagataatt tggctccaattacagctaaa gcaaaagtgg 2520 ttattgaact gcttttatcg gtctcgggag ttgctaaaccttcccaggtg tattttggag 2580 gtacagttgt tggcgagcaa gctatgaaat ctgaagatgaagtgggaagt ttaatagagt 2640 atgaattcag ggtaataaac ttaggtaaac ctcttacaaacctcggcaca gcaaccttga 2700 acattcagtg gccaaaagaa attagcaatg ggaaatggttgctttatttg gtgaaagtag 2760 aatccaaagg attggaaaag gtaacttgtg agccacaaaaggagataaac tccctgaacc 2820 taacggagtc tcacaactca agaaagaaac gggaaattactgaaaaacag atagatgata 2880 acagaaaatt ttctttattt gctgaaagaa aataccagactcttaactgt agcgtgaacg 2940 tgaactgtgt gaacatcaga tgcccgctgc gggggctggacagcaaggcg tctcttattt 3000 tgcgctcgag gttatggaac agcacatttc tagaggaatattccaaactg aactacttgg 3060 acattctcat gcgagccttc attgatgtga ctgctgctgccgaaaatatc aggctgccaa 3120 atgcaggcac tcaggttcga gtgactgtgt ttccctcaaagactgtagct cagtattcgg 3180 gagtaccttg gtggatcatc ctagtggcta ttctcgctgggatcttgatg cttgctttat 3240 tagtgtttat actatggaag tgtggtttct tcaagagaaataagaaagat cattatgatg 3300 ccacatatca caaggctgag atccatgctc agccatctgataaagagagg cttacttctg 3360 atgcatagta ttgatctact tctgtaattg tgtggattctttaaacgctc taggtacgat 3420 gacagtgttc cccgatacca tgctgtaagg atccggaaagaagagcgaga gatcaaagat 3480 gaaaagtata ttgataacct tgaaaaaaaa cagtggatcacaaagtggaa cagaaatgaa 3540 agctactcat agcgggggcc taaaaaaaaa aaagcttcacagtacccaaa ctgctttttc 3600 caactcagaa attcaatttg gatttaaaag cctgctcaatccctgaggac tgatttcaga 3660 gtgactacac acagtacgaa cctacagttt taactgtggatattgttacg tagcctaagg 3720 ctcctgtttt gcacagccaa atttaaaact gttggaatggatttttcttt aactgccgta 3780 atttaacttt ctgggttgcc tttgtttttg gcgtggctgacttacatcat gtgttgggga 3840 agggcctgcc cagttgcact caggtgacat cctccagatagtgtagctga ggaggcacct 3900 acactcacct gcactaacag agtggccgtc ctaacctcgggcctgctgcg cagacgtcca 3960 tcacgttagc tgtcccacat cacaagacta tgccattggggtagttgtgt ttcaacggaa 4020 agtgctgtct taaactaaat gtgcaataga aggtgatgttgccatcctac cgtcttttcc 4080 tgtttcctag ctgtgtgaat acctgctcac gtcaaatgcatacaagtttc attctccctt 4140 tcactaaaaa cacacaggtg caacagactt gaatgctagttatacttatt tgtatatggt 4200 atttattttt tcttttcttt acaaaccatt ttgttattgactaacaggcc aaagagtctc 4260 cagtttaccc ttcaggttgg tttaatcaat cagaattagaattagagcat gggagggtca 4320 tcactatgac ctaaattatt tactgcaaaa agaaaatctttataaatgta ccagagagag 4380 ttgttttaat aacttatcta taaactataa cctctccttcatgacagcct ccaccccaca 4440 acccaaaagg tttaagaaat agaattataa ctgtaaagatgtttatttca ggcattggat 4500 attttttact ttagaagcct gcataatgtt tctggatttacatactgtaa cattcaggaa 4560 ttcttggaga agatgggttt attcactgaa ctctagtgcggtttactcac tgctgcaaat 4620 actgtatatt caggacttga aagaaatggt gaatgcctatggaactagtg gatccaaact 4680 gatccagtat aagactactg aatctgctac caaaacagttaatcagtgag tcgagtgttc 4740 tattttttgt tttgtttcct cccctatctg tattcccaaaaattactttg gggctaattt 4800 aacaagaact ttaaattgtg ttttaattgt aaaaatggcagggggtggaa ttattactct 4860 atacattcaa cagagactga atagatatga aagctgattttttttaatta ccatgcttca 4920 caatgttaag ttatatgggg agcaacagca aacaggtgctaatttgtttt ggatatagta 4980 taagcagtgt ctgtgttttg aaagaataga acacagtttgtagtgccact gttgttttgg 5040 ggggggcttt ttttcttttt ccggaaaatc cttaaaccttaagatactaa ggacgttgtt 5100 ttggttgtac ttggaattct tagtcacaaa atatattttgtttacaaaaa tttctgtaaa 5160 acaggttata acagtgttta aagtctcagt ttcttgcttggggaacttgt gtccctaatg 5220 tgttagattg ctagattgct aaggagctga tacttgacagttttttagac ctgtgttact 5280 aaaaaaaaga tgaatgtcgg aaaagggtgt tgggagggtggtcaacaaag aaacaaagat 5340 gttatggtgt ttagacttat ggttgttaaa aatgtcatctcaagtcaagt cactggtctg 5400 tttgcatttg atacattttt gtactaacta gcattgtaaaattatttcat gattagaaat 5460 tacctgtgga tatttgtata aaagtgtgaa ataaattttttataaaagtg ttcattgttt 5520 cgtaacacag cattgtatat gtgaagcaaa ctctaaaattataaatgaca acctgaatta 5580 tctatttcat caaaaaaaaa aaaaaaaaaa actttatgggcacaactgg 5629 52 4994 DNA Homo sapiens 52 ccgcagcgct cggctggctgcagcggcacc gcgggttgcg cggccgggga tgctccagcg 60 ggcgcgatgg cccccgccatgcagccggcc gagatccaat ttgcccagcg gctggcgtcc 120 agcgagaagg gcatccgggaccgagcggtg aagaagctgc gccagtacat cagcgtgaag 180 acgcagaggg agacaggaggtttcagtcag gaagaacttc tacaggaaga gctcgccaac 240 accattgcac agctagtccatgctgttaac aactcagcgg ctcaacacct gttcattcag 300 accttttggc aaaccatgaatcgagaatgg aaaggaatag acaggctacg cctggacaaa 360 tactatatgc tgattcgtctggtcctgagg cagtcctttg aagtcttgaa gcgaaatggc 420 tgggaagaaa gccgaatcaaggttttcttg gatgtcctga tgaaggaggt cctgtgtcct 480 gagagtcagt ctcctaatggagtgagattc cacttcattg atatttacct ggatgaactc 540 tccaaagtcg gggggaaggagcttttagca gatcagaatc tcaagtttat cgatccattc 600 tgcaaaattg ctgcaaagacgaaggaccac accctggtac agaccatagc tcggggtgtc 660 ttcgaagcta tcgtagatcagtctcctttt gtgcctgaag agacgatgga ggaacagaag 720 acaaaagtgg gtgatggtgacctctctgct gaggagatac ctgaaaatga ggtatccttg 780 agaagagctg tcagtaaaaagaagacagca ctgggcaaaa accattccag aaaagatgga 840 ctcagtgatg aaagaggaagagatgactgt ggaacctttg aggacacagg gccccttctc 900 cagtttgact ataaggctgttgctgatcga ctcctggaaa tgaccagcag gaagaacacg 960 ccccacttca acaggaagcgcctctccaaa ctcatcaaga aattccaaga cctttctgaa 1020 ggaagcagta tatctcaactcagttttgcg gaggacattt ctgctgatga agatgaccaa 1080 atcctcagtc aaggaaagcataagaagaaa ggaaataaac ttttagagaa aactaacttg 1140 gaaaaggaga aaggaagcagagtcttttgt gtagaggaag aggacagtga aagcagtctt 1200 caaaagagaa gaaggaagaagaagaagaag caccacctgc agcctgaaaa tccaggccca 1260 gggggtgcag ccccgtccctggaacagaac cggggcaggg agcccgaggc ctctgggccg 1320 aaagccctga aggcacgtgtggccgagcca ggtgcagagg ccacgtccag cactggggag 1380 gagagtggct ccgagcatcctccagccgtc cccatgcaca ataaaaggaa acggccacgg 1440 aagaagagcc cgagggcccacagggaaatg ttggaatcag cagtgttgcc cccagaggac 1500 atgtctcaga gtggcccgagtggcagtcat cctcagggac ctagagggtc cccgacaggt 1560 ggagcccaac tcctaaaaaggaagcggaaa cttggagttg tgcccgtcaa tggcagtggc 1620 ctgtccacgc cggcctggcctccattgcag caggaaggcc ctcccacagg ccccgcagag 1680 ggggcgaaca gccacaccacgctgccccag cgcaggaggc tgcagaaaaa gaaggcaggg 1740 cccggcagcc tggagctctgtggcctgccc agccagaaaa cagcaagttt gaaaaagagg 1800 aagaaaatga gagtgatgtcaaacttggtg gagcacaacg gggtgctgga gtccgaagct 1860 gggcaacccc aggctctgggaagcagtggg acttgcagtt ccctgaagaa gcagaagctg 1920 agggcagaga gcgactttgtgaagtttgac acccccttct taccaaagcc cctgttcttc 1980 agaagagcca agagcagcactgccacccac cctccaggcc ctgccgtcca gctaaacaag 2040 acaccatcca gctccaagaaagtcaccttt gggctgaaca gaaacatgac tgccgaattc 2100 aagaagacag acaagagtatcttggtcagt cccacgggcc cttctcgagt ggccttcgac 2160 cctgaacaga agcccctccacggggtgctg aagaccccca ccagctcacc tgccagctca 2220 cccctggtgg ccaagaagcccctgaccacc acaccaagga gaaggcccag ggctatggat 2280 ttcttctgag gagcagcagagtcccttgta aaagactgct tttgtacaga atgcgctata 2340 aattatacct ttaagaatgtggggcctttt ttatgatttt gtaagttccc ataagttgtg 2400 tgcacgaggt tctgagagtgcccgcaggct gctgcgtcct ggcccctctg tagtggctgc 2460 gggcgtcttg gttgaatcttttgctacaaa ccatgtttgc gtttgagctc tccaggattt 2520 tacatttttg ggtaacctcagtgattccca ttggtgtagg aaatgagacc ctctctgaag 2580 ctgaggagag cacgttgatctgaactttaa atcaatcagt gctgctggca caatgaaagg 2640 tggaactgca cttgtgttgagctctcagtt ctgcggaatt tggtactcat taccgtattc 2700 gccgtactaa gttggtttctgttagtctta acagtctgtt ttcttttaaa agcatgtagg 2760 gcttcattgc catgttctgtgggtgtttgg caggttaccg atggggaaga ttcttgtcac 2820 agaatcagca ataccatagtttttctacat gtgctcagct gggggtgtgg acaggtaggg 2880 gtggggaaag aagaggctctgcgttctggg ggctttttct tctcctcccc ctacccggtt 2940 tccctccctg ttttcctacctctacggcaa gcccaaagtg tcttcccggg agcccagcgc 3000 agcccccggc tcttacccaggaccccgccc cgtgctgagc cttctgctga ggtccttgcg 3060 tggagcacac tcattcctccaagcccttgc gctcccgttt ctctctctct ccgtccacgt 3120 tccagccgag tcactgcctgcctgaccggc tccatggcag ctccccatct tccctagagg 3180 ctgcctgcgc atctggagcctgcgctccgg ctcagcgacc tttcctctca aatgcggaag 3240 cgtgcactta cagttcagaccgttctcctg taagttcatt acaaacacgg gcggaaggca 3300 ctcaggcttt cgttggagaaacagaaataa ggccttcttt tgagcagcga ttgctggatc 3360 attgatctgt ttgaggaagtgtctgacctg ggcctgagag ctggagaagg tgcagattca 3420 aagtgagcgg ctcctgaggagagccgccaa ggctgctcgc cttctccgtg gcttccgcag 3480 ctaccgtctg cacggtgagagggcacgggc acacggttcg ggctggcgtg cagctctccc 3540 agccagccac gctctgctcaggcctggaag tgaaagccgc ctccttcccg ttatgccccc 3600 catacaggag cctcggtttttcagcaaaac gcggccagtc cccttctcca ctgctgcctc 3660 ccagcagagg gccccaggatctccaaggtc ccagctatgg ctttggacaa cgtggcttcg 3720 gcccctgggg ttgcagagcttgcattgggt ttacctcggt ctcattcatt catggagcca 3780 agggtggggt ttcacctgcgaacatcagac tgacttgctg gcgtcaagag cagttgactc 3840 actgatgaag gccctggtgaggagaaagca ctctgttctt cgcctactct gtaatcgttt 3900 tgtcataatg agccatgaaaaaagtaatga acttgtgctg ttaatcgtca ctgtaatgag 3960 aagtcttacg tacaacatagctgtggtggc tgcgtggttt aatggctgca ttagatagga 4020 tcctcacatc ccattcagaaccaaaactga tacagtgaaa caattaaggt gagcaaatag 4080 ttttaacttt tcttttttttttaagtttca ttcttcctag aatatttttc taacaatttt 4140 tatttcagct ttaaagatgggtcatatagc caaacgggcc atataatcca acattgttga 4200 gatgtcttag gacatctaaggcaaaactgg cacatttgtt ctgcagacta ttgcaggaat 4260 gttttttcct agcatttctatattatctgt ccattctgag gaaccagtga atgtcctata 4320 aatgcacctc ctgtcaaaaccatgcctgag aggtcccggc tgggagtgac agggtgcttc 4380 ttagattcta ttggtccttctctcattctc cgaacttact cctttttatg ggtaagtcaa 4440 ctaggtttac agtcccttatttttaatgcc taagttttga cagcaggaag aaaacaattt 4500 tttaaaaatt ctcattacatagacgcacaa gaatatgtca cataaagaaa atgtgtttag 4560 aatactggtt ttctatttacgcatgatatt ttcctaagta aaattgccaa gtggacttgg 4620 aagtccagaa aggaaaataatttaaattaa tgctggtgat cttaacaata ttttgtaaaa 4680 tgatgcttcc cccttctccatggtctagtc aattttgtac aattaggtat ctgactttac 4740 aagtttgtta tcctttctaatttttactga actgaaagca caaagaagac tacacagaaa 4800 atctggaaac agttgcaggtgttgggagga agatgaaatc gagctgtctt ttaacttttg 4860 tatgtgtttt atcagaatttgctggactat gctggcaagg actttgttta cgatcaaatt 4920 gtactagtgt ctgcagggtttgtcagtact cgtcaaagcc aagtccaatt aaaaaaaaaa 4980 agtctttgcc ctcc 4994 531202 DNA Homo sapiens 53 ggcacgaggc gccatttgct gccgccgagc gtggacgcaggcggatctct gaagagctgg 60 gtcgccagcc tctcccgcgc acgttgcctg gcctccagcacctacttggt cccgcgcgct 120 ccctcgtgtc gcccctcgga gcagcagccg ccgcggtcgccgctacccgg aaagaagtca 180 gagacgccgc gagtcgccgc caccgccatg cccaagaataaaggtaaagg aggtaaaaac 240 agacgcaggg gtaagaatga gaatgaatct gaaaaaagagaactggtatt caaagaggat 300 gggcaggagt atgctcaggt aatcaaaatg ttgggaaatggacggctaga agcaatgtgt 360 ttcgatggtg taaagaggtt atgtcacatc agaggaaaattgagaaaaaa ggtttggata 420 aatacctcgg acattatttt ggttggtctc cgagactaccaggataacaa agctgatgta 480 attttaaaat acaatgcaga cgaagctaga agtctgaaggcatacggcga gcttccagag 540 catgctaaaa tcaatgaaac tgatacattt ggtcctggagatgatgatga aattcagttt 600 gatgacattg gagatgatga tgaagatatt gatgacatctaaattgaact caacatttta 660 cattccatct tttctgaaga ttgtcctaca atttggattttgatcatgac aaagaagatt 720 aaaatttcat tagcatgaat gcaatttgtt aaagcagactgatttgtttc taagatattt 780 ttggtttttt taaaactgat aataatgctg aattatcttaagtgagatgt taagcccact 840 ttgttctttt aatgtaatgg agcttatggg tagaagaccatgtctactaa ttacaaaaaa 900 aaaaaaaaac catgattgct gcttttccta ccacttccagtaagaaaatg ggtgttttga 960 agaaatcatt tgccttgtct cacggaatct gattaagccctggcctcttg atgtatagag 1020 tcatggatat tccagttacc tagatattcc cttgagattttgatacaatt tgagggaggc 1080 agaagtctgc agttgaagaa aaaaaataag tctgtttgtcatatttaagt agcctgtgcg 1140 tatttttata ctgattttga tatcatgttc ttttcatagtcgtattttgc caccgtaaac 1200 at 1202 54 1745 DNA Homo sapiens 54ctgctcgaga aggagctgga gcagagccag aaggaggcct cagaccttct ggagcagaac 60cggctcctgc aggaccagct gagggtggcc ctgggccggg agcagagcgc ccgtgagggc 120tacgtgctgc aggccacgtg cgagcgaggg tttgcagcaa tggaagaaac gcaccagaag 180attgaagatc tccagaggca gcaccagcgg gagctagaga aacttcgaga agagaaagac 240cgcctcctag ccgaggagac agcggccacc atctcagcca tcgaagccat gaagaacgcc 300caccgggagg aaatggagcg ggagctggag aagagccagc ggtcccagat cagcagcgtc 360aactcggatg ttgaggccct gcggcgccag tacctggagg agctgcagtc ggtgcagcgg 420gaactggagg tcctctcgga gcagtactcg cagaagtgcc tggagaatgc ccatctggcc 480caggcgctgg aggccgagcg gcaggccctg cggcagtgcc agcgtgagaa ccaggagctc 540aatgcccaca accaggagct gaacaaccgc ctggctgcag agatcacacg gttgcggacg 600ctgctgactg gggacggcgg tggggaggcc actgggtcac cccttgcaca gggcaaggat 660gcctatgaac tagaggtctt attgcgggta aaggaatcgg aaatacagta cctgaaacag 720gagattagct ccctcaagga tgagctgcag acggcactgc gggacaagaa gtacgcaagt 780gacaagtaca aagacatcta cacagagctc agcatcgcga aggctaaggc tgactgtgac 840atcagcaggt tgaaggagca gctcaaggct gcaacggaag cactggggga gaagtcccct 900gacagtgcca cggtgtccgg atatgatata atgaaatcta aaagcaaccc tgacttcttg 960aagaaagaca gatcctgtgt cacccggcaa ctcagaaaca tcaggtccaa gagtctgaag 1020gaaggcctga cggtgcaaga acggttgaag ctctttgaat ccagggactt gaagaaagac 1080taggtgtgtc ccatccaagt tgagcacgcg ccttccccag cttgcagcag cacaccccaa 1140gcgctgcttt tcacctgtac ctttgtttta ttattattat tattattgct gttgttgtca 1200tcgttaactg tgggcatgga atgcgtgagg ctggcttctg ggttgtccac accactctct 1260gctgtgttga cttcctgttg tcttcaacaa agcttttttc cgtggtattc taaaattagg 1320ccagcagtgg gggctgggag ggcatctgtg ttagtccttt cctggctgtg acccgccaca 1380ctcactgtca gtattaaggc ccagcagcct gttgataagc taccctgtct caccatgtgc 1440tggtgtggaa acggggccca gccagcacgc ctcaaggtag atggaatccc cactggtcag 1500agaaaaagct atgcggacac tccagcttgg cctgggtcac agcactgact cctcacccgc 1560tagtctggct gttaagagga gaaagtgcac tgccttccag cccaggagga ggacagcatt 1620ttgtatttgt tccactgatg cagcttagac ccacacccct gagagtcgtg gcaaaccttt 1680cacaacctgg aaaatgttga aagcaaccat tcctattttt gtttgttttt tattaaatct 1740tgcac 1745 55 976 DNA Homo sapiens 55 cccggaacct ggcgcaactc ctagagcggtccttggggag acgcgggtcc cagtcctgcg 60 gctcctactg gggagtgcgc tggtcggaagattgctggac tcgctgaaga gagactacgc 120 aggaaagccc cagccaccca tcaaatcagagagaaggaat ccaccttctt acgctatggc 180 aggtaagaaa gtactcattg tctatgcacaccaggaaccc aagtctttca acggatcctt 240 gaagaatgtg gctgtagatg aactgagcaggcagggctgc accgtcacag tgtctgattt 300 gtatgccatg aactttgagc cgagggccacagacaaagat atcactggta ctctttctaa 360 tcctgaggtt ttcaattatg gagtggaaacccacgaagcc tacaagcaaa ggtctctggc 420 tagcgacatc actgatgagc agaaaaaggttcgggaggct gacctagtga tatttcagtt 480 cccgctgtac tggttcagcg tgccggccatcctgaagggc tggatggata gggtgctgtg 540 ccagggcttt gcctttgaca tcccaggattctacgattcc ggtttgctcc agggtaaact 600 agcgctcctt tccgtaacca cgggaggcacggccgagatg tacacgaaga caggagtcaa 660 tggagattct cgatacttcc tgtggccactccagcatggc acattacact tctgtggatt 720 taaagtcctt gcccctcaga tcagctttgctcctgaaatt gcatccgaag aagaaagaaa 780 ggggatggtg gctgcgtggt cccagaggctgcagaccatc tggaaggaag agcccatccc 840 ctgcacagcc cactggcact tcgggcaataactctgtggc acgtgggcat cacgtaagca 900 gcacactagg aggcccaggc gcaggcaaagagaagatggt gctgtcatga aataaaatta 960 caacatagct acctgg 976 56 3394 DNAHomo sapiens 56 gtcccgagcg ccggcctgcg gagcgtagca gcccgggcca gacgccggaggagggcgcgc 60 aggccttggc cgagttcgcg gcgctgcacg gcccggcgct gcgcgcttcgggggtccccg 120 aacgttactg gggccgcctc ctgcacaagc tggagcacga ggttttcgacgctggggaag 180 tgtttgggat catgcaagtg gaggaggtag aagaggagga ggacgaggcagcccgggagg 240 tgcggaagca gcagcccaac ccggggaacg agctgtgcta caaggtcatcgtgaccaggg 300 agagcgggct ccaggcagcc caccccaaca gcatcttcct catcgaccacgcctggacgt 360 gccgtgtgga gcacgcgcgc cagcagctgc agcaggtgcc cgggctgctgcaccgcatgg 420 ccaacctgat gggcattgag ttccacggtg agctgcccag tacagaggctgtggccctgg 480 tgctggagga gatgtggaag ttcaaccaga cctaccagct ggcccatgggacagctgagg 540 agaagatgcc ggtgtggtat atcatggacg agttcggttc gcggatccagcacgcggacg 600 tgcccagctt cgccacggca cccttcttct acatgccgca gcaggtggcctacacgctgc 660 tgtggcccct gagggacctg gacactggcg aggaggtgac ccgagactttgcctacggag 720 agacggaccc cctgatccgg aagtgcatgc tgctgccctg ggcccccaccgacatgctgg 780 acctcagctc ttgcacaccc gagccgcccg ccgagcacta ccaggccattctggaggaaa 840 acaaggagaa gctgccactt gacatcaacc ccgtggtgca cccccacggccacatcttca 900 aggtctacac ggacgtgcag caggtggcca gcagcctcac ccacccgcgcttcaccctca 960 cccagagtga ggcggacgcc gacatcctct tcaacttctc acacttcaaggactacagga 1020 aactcagcca ggagaggcca ggcgtgctgc tgaaccagtt cccctgcgagaacctgctga 1080 ctgtcaagga ctgcctggcc tccatcgcgc gccgggcagg tggccccgagggcccaccct 1140 ggctgccccg aaccttcaac ctgcgcactg agctgcccca gtttgtcagctacttccagc 1200 agcgggaaag gtggggcgag gacaaccact ggatctgcaa gccctggaacctggcgcgca 1260 gcctggacac ccacgtcacc aagagcctgc acagcatcat ccggcaccgagagagcaccc 1320 ccaaggttgt gtccaagtac atcgaaagtc ccgtgttgtt ccttcgagaagacgtgggaa 1380 aggtcaagtt cgacatccgc tacatcgtgc tgctgcggtc agtgaggcccctacggttgt 1440 tcgtgtatga tgtgttctgg ctgcggttct ccaaccgggc ctttgcactcaacgacctgg 1500 atgactacga gaagcacttc acggtcatga actatgaccc ggatgtggtgctgaagcagg 1560 tgcactgtga agagttcatc cccgagtttg agaagcaata cccagaatttccctggacgg 1620 acgtccaggc tgagatcttc cgggccttca cggagctgtt ccaggtggcctgtgccaagc 1680 caccacccct gggcctctgc gactacccct catcccgggc catgtatgccgtcgacctca 1740 tgctgaagtg ggacaacggc ccagatggaa ggcgggtgat gcagccgcagatcctggagg 1800 tgaacttcaa ccccgactgt gagcgagcct gcaggtacca ccccaccttcttcaacgacg 1860 tcttcagcac cttgtttctg gaccagcccg gtggctgcca cgttacctgccttgtctagg 1920 cactcgctgt ccccaaaacc tgtgcttggg gcaggattcc aacctcagttctctgagctg 1980 cttctgcaaa ggcccccatg tccctcccca caccggccct gggcatagcctcagccccag 2040 gcctctgtcc tgccgagcca tcctcccggc gccacactcc gggagcacagcatcctcctc 2100 tcacctgtgg gtcagagcag gacagtgatg gtgtccccag ggctgagcaccaccccacgc 2160 cctgccctca cccctcacca ccatctgtgc actgatgagt ctccagtttagccaagggct 2220 tcgttcctgg catggagaat ttgttcctgg ctgctgtgtt tccagggggtgctgggggaa 2280 gggttccgtg gagcgagaca aggtgtcctc gggagcaggg ttccaccgggaagcgtttgg 2340 gagccctgta tcacacgggg caggcgggtt tctcttccgg ggtctctgctcttatgcatc 2400 aggacgaccc cgggacggct gtggggcccc acactgcacc cacagggctctatgcgacag 2460 gggcccagga acagcctgag gccaccaccc agcaagcccg ccttatcacccattccagct 2520 cacccagaac cttcaccagc aaacctcctg ctgaggtcct ggcaggaggccaccgtcttg 2580 ttaccgtttc cttttcgttt gctgagggtc acagacccca acagggaaatcagtatctgt 2640 cttcccagtg gttgccctgc tcgccgggca ctccacgggg tcccgcccttgtgtgagatg 2700 ggccaggatc cttcggcaag gggcgcctgg ggctggggct gattgtgggcggtggagcgc 2760 cagacagaaa aggattccaa tgagccccag ccccaggcgc cccttgccgaaggatcctgg 2820 ggctggggct gattgtgggc ggtggagcgc cagacagaaa aggattccaatgagaacttc 2880 aggttaaagt cagatgccac ctaccagggt ctacagtcaa aatgttggctttttcttatt 2940 ttttaatgta tgggagaaaa atgtaaaatt ccagttcttt tctaattgtgtttctgaaat 3000 taggagtcag ctgccagcgt ttttgtgtgg ctgcagtgtg cctgggcccagctcacgggc 3060 agtgggtgga cctaactgcc caggcaggcg agagctactt ccagagccttccagtgcatg 3120 ggagggcagg gctaggtgta gcggtgtctc ctctttgaaa ttaagaactatctttcttgt 3180 agcaaagctg cacctgatga tgctgcctct cctctctgtg ttgtctgggcccttgtttac 3240 aagcacgcgt tacccttcct gaggggagcc atgctctagc ccctggagggcctgttgcag 3300 gggcagggcg ggcccgtcgc ctttggcagc tcctggagag ctgtggacatgcagtccccc 3360 tcagttcgtg ctgcaataaa ggccatcttc tctt 3394 57 1526 DNAHomo sapiens 57 gttttttttt ttttttttaa ttgcaagcat atttctttta atgactccagtaaaattaag 60 catcaagtaa acaagtggaa agtgacctac acttttaact tgtctcactagtgcctaaat 120 gtagtaaagg ctgcttaagt tttgtatgta gttggatttt ttggagtccgaaggtatcca 180 tctgcagaaa ttgaggccca aattgaattt ggattcaagt ggattctaaatactttgctt 240 atcttgaaga gagaagcttc ataaggaata aacaagttga atagagaaaacactgattga 300 taataggcat tttagtggtc tttttaatgt tttctgctgt gaaacatttcaagatttatt 360 gatttttttt tttcactttc cccatcacac tcacacgcac gctcacactttttatttgcc 420 ataatgaacc gtccagcccc tgtggagatc tcctatgaga acatgcgttttctgataact 480 cacaacccta ccaatgctac tctcaacaag ttcacagagg aacttaagaagtatggagtg 540 acgactttgg ttcgagtttg tgatgctaca tatgataaag ctccagttgaaaaagaagga 600 atccacgttc tagattggcc atttgatgat ggagctccac cccctaatcagatagtagat 660 gattggttaa acctgttaaa aaccaaattt cgtgaagagc caggttgctgtgttgcagtg 720 cattgtgttg caggattggg aagggcacct gtgctggttg cacttgctttgattgaatgt 780 ggaatgaagt acgaagatgc agttcagttt ataagacaaa aaagaaggggagcgttcaat 840 tccaaacagc tgctttattt ggagaaatac cgacctaaga tgcgattacgcttcagagat 900 accaatgggc attgctgtgt tcagtagaag gaaatgtaaa cgaaggctgacttgattgtg 960 ccatttagag ggaactcttg gtacctggaa atgtgaatct ggaatattacctgtgtcatc 1020 aaagtagtga tggattcagt actcctcaac cactctccta atgattggaacaaaagcaaa 1080 caaaaaagaa atctctctat aaaatgaata aaatgtttaa gaaaagagaaagagaaaagg 1140 aattaattca gtgaaggatg attttgctcc tagttttgga gtttgaatttctgccaggat 1200 tgaattattt tgaaatctcc tgtcttttta aactttttca aaataggtctctaaggaaaa 1260 ccagcagaac attagcctgt gcaaaaccat ctgtttgggg agcacactcttccattatgc 1320 ttggcacata gatctccctg tggtgggatt ttttttttcc ctttttttgtgggggagggt 1380 tggtggtata tttttcccct cttttttcct tcctctccta catctcccttttcccccgat 1440 ccaagttgta gatggaatag aagcccttgt tgctgtagat gtgcgtgcagtctggcagcc 1500 ttaagcccac ctgggcactt ttagat 1526 58 8213 DNA Homosapiens 58 cccccagcag aagggcgcga cggctgcaac atcagcggtt aaattgtacagcctttcata 60 ggccggttca atgcatccgt actaagattg ttaaggctga gggtccctagcctggggaaa 120 aacgaaagga ggcagagggt agggagacgg gaaggaagac aaggagggtgtagaaaacgg 180 ggagaggagg gggcgggaca gcatggggaa ggcctcaggt ttactggagagatcgtggcg 240 ttcccataga aacgtatccc tccgcccatg acccgcgtgt tagtctcttcagttccttcc 300 gcgtcgtttc ttggctgttt ccgcccagct cctttgtgcc gcgcagaacaacgagatgac 360 gcatgcgcaa agcgcagcgg ccgcatatat aaacgcgaac ccgggctcttcctcgtagtg 420 ccgccgggac tcttggcggg tgaaggtgtg tgtcagcttt tgcgtcactcgagccctggg 480 cgctgcttgc taaagagccg agcacgcggg tctgtcatca tgtcgcgttacgggcggtac 540 ggaggaggta agaagctgga gtccggtgag ggacgttggt gtgggtgtagtgagcactgc 600 gaggccgtag ggttgtcgcg gaggttggga gacggttatt ccgcgtgcgtaatggcggct 660 taggagcacg ccagacgaag ccggaggcag cggaggcggg gtgctgaagggagacgggat 720 ggcgggtgta catctctgcc gagttccgta ctcttgggca tttttgtggcccaatccagc 780 ctaaagcagg gttgagatga cggttttcgc gttgcctttc tcggagctgcccgccggccc 840 ccctcccccc ccgccctcgg ccggcggctg ccattttgcg cacattgaggaccgtggtgg 900 cgcatttcct cagcgctttc ccgccacttc agcggacaga tctggccgcagctgtaagat 960 cgtggttgtg tttgagatag aacgaaattg gcagctgtga gctgcatgttctcgtcaaac 1020 aatcggttaa attgcggaat gggaatgggg acgtaatctg cgactggcggctgggttttt 1080 ttttagttat ttccagcgcg gtttatggct ctggggcggg gagctggagtcttgggcgag 1140 cctgtgcctg ggacgtttgc cgcggaggac gagagccggc gcagccctgctctcctggcc 1200 cggcccctac cgaggccctc ccgccgccga cgcgctgccg ctgcgggcccgcgcgctccc 1260 ggtgcgcccg gggctgccgg gactcatggg tggggccggg ccaggtcccgccccacgcct 1320 cggtgtatcc taccacgcgt ttctgcttgt gttcgggagg gtcaccccgcattatttaga 1380 acgttaagaa ttttgtcaaa agtctagttt ctcggggatt tgcggacttcaccagtttta 1440 cgactaagtt ttgtcttgga tagagggcat taaatgtgct ttacccaatcttgaggatgg 1500 cccgttttaa ggcaagtaag taattgaaac ttgggccaga ttttgcataacgtgcattct 1560 tctatttgcg tttttaaaca gaaaccaagg tgtatgttgg taacctgggaactggcgctg 1620 gcaaaggaga gttagaaagg gctttcagtt attatggtcc tttaagaactgtatggattg 1680 cgagaaatcc tccaggattt gcctttgtgg aattcgaaga tcctagagatgcagaagatg 1740 cagtacgagg actggatgga aagtaagtaa gatgttatga atcttctgttcattaaaata 1800 tactgtggct agataatgaa cttagtgcta aatttggatt ctgaagtctggaagagacct 1860 taaatagctg gtcatagtgt taaatgctaa aggcacacga aggttaaagaagatagcgga 1920 gatggagtta gggcttggta aagaccgcca aagtttgttg ggggggaaggagtggttgga 1980 aagagtgagt ggttggaaag agttcttttt aaatctataa gtcctgaatatatttttaac 2040 tttagaattt tgttaatttg cttttattag ggtgatttgt ggctcccgagtgagggttga 2100 actatcgaca ggcatgcctc ggagatcacg ttttgataga ccacctgcccgacgtccctt 2160 tgatccaaat gatagatgct atgagtgtgg cgaaaaggga cattatgcttatgattgtca 2220 tcgttacagc cggcgaagaa gaagcaggta tttattttaa taaaggaatggttggtattc 2280 tagttaatca agtaattctt ttattagcaa ggcagaaact agtgtttttctataaacttg 2340 aatgttaatt gtacaggtgt attttacaat ttgtgtttaa ttaaaaaaatgttactatat 2400 taataatcaa cctggtcaaa acctttcagg tttcttcgtt tgagtcagtcgccttgattc 2460 agaatgtcac gagccttatg atatcatgct gaggcgcctt gcaaatccgacaattaagat 2520 cctcctagac cttgaggtga tcagcataag aggccagatc ccctcgagtcatctacacct 2580 agcttcacct tattctttaa agggcagaaa atttgagacg gtgatcgccgtaacagtaaa 2640 tttggcttac aattggggcc cccctccggt ttagaaagag gaacaccagattgaccacat 2700 tcccaactag aaaaatcttc ttgcgtcaat caagcctcac ctggctcatttggctgtcag 2760 tttgatcgtc gttagattga agaaaacatc tagatgcagc gatcggctatagatacttct 2820 agatcgtcta gatctactag accatgggcc aaagagggtc gacctgcaaacttgcaaggt 2880 ttatgttaaa tacacattac agtgttttat attatgtaat gctaagttgtaattcagctt 2940 ttaacaaatc tttttttagg tagtaaaaaa aaaaatactc aacaactaataggcccagag 3000 tttatttcca aatgagacac taaatttaaa tagttttgag atttgatttcagcagaggca 3060 cacaaactct taaaaacgag ttattgtctg acattttgtt ttttctctaacttgaaaaat 3120 aggtcacggt ctagatcaca ttctcgatcc agaggaaggc gatactctcgctcacgcagc 3180 aggagcaggg gacgaaggtg agatcttgtt taactgaagt ctttctgtattattattaaa 3240 ttcactggta gtccaacaca gaaaaagctc attatttttt ttggagacagggtcttgctc 3300 tgtcacccgg gctggagtac aggggcataa ccacgactca ctgctgccttgatgatctct 3360 tgggtttaag cagttctcct acctcagcct cccgagtagc tgggactgtaggcactgcca 3420 ccatacccag ctaattttta tttttgtaga aatggtcttg cactgtttcccaggctggtc 3480 tcaagctcct gggctcaaac gatcctcccg cagtgctggg attatgggcatgagccactg 3540 caccgttccc cagttgaagt cttaacaggc caaaaaaaaa aaaaactgtggagatggact 3600 taaagttctt tattttaggt caaggtcagc atctcctcga cgatcaagatctatctctct 3660 tcgtagatca agatcagctt cactcagaag atctaggtct ggttctataaaaggatcgag 3720 gtatttccag tatgtaacac tttttttcct tacttgtgtt tggattgttcacatcttatc 3780 agtagagtgt cttaaggaca taattcaaat ggattgcttc agggaatatttgagatgtaa 3840 aagtttggaa tttatgtgta acttgtaaca taaatattac cctagtttcacagatgaaga 3900 aaagggctac tagagatttt aaggcttgtt aggccgtgtg gtagacaagggtcccaagca 3960 atacagctct actcaacact ctgggtaggc atgttgctat aaacttttctggcttcagat 4020 tggatgatac tagctctgaa agatggtaat tgattttccc gacaaaaaggcctattagca 4080 ccaggaaaag agatcagaag caagtagaaa catttctcat ttttggaatgatggggttga 4140 tttgagacac tggaaagttg actagggcag tagtgtgtac acagaaatgaatgtggattt 4200 tttttttaga ccgtttcaga cctgaaaaaa ctaaagaacc agagctttactatttgtaga 4260 aggccttaaa aggagataga atggaaaaaa ttgtaaaata agtattgcaacatgtaatta 4320 acaatattgt tatctgtacc aacgataaaa ccgtggtacg gaatgctactgggagttaaa 4380 ttgctgttta atagcacaaa acctttaaat gcaggaattc tgaatcttgtggtctatttg 4440 agaaagctat gaaccatctc tttagataaa tttaaaagat agatatgtcagtctgatttg 4500 gtttgtctga cagattgatg gctctcaaac ataacttgat ccgggaagaagcctgacaaa 4560 tggggggcgg ctttcttttc gtctggcctt atcacctgaa ttagtctcagttcaggggtc 4620 tggttatttt catcctgcct tagcctcctg agtagctggg actgccattgtgtaccacag 4680 tgcccagctg agggatctgt gccttaagtg aggttagttt tgcttccttcataccagtct 4740 catcaaatga aaaccatgta tttcccttgg atattacaca gtgtttgagaatgttatacc 4800 tgtacagaaa ctaaccaatt gagtgataga aacaagtaat tgaaatgggggttccttatg 4860 tctggtaaca ctttgtttga cagtgtgtta gacagaataa ggcaagtgttgcatcttgtt 4920 tagttttagc ttctttatgc ctgaccaacc taatacagtg ttgagtagttaaggaaattc 4980 ctttggactg attgatataa ttgtgttttt tcactttttt tattaagatccccgtcgagg 5040 tcaagatcaa gatccaggtc tatttcacga ccaagaagca ggtagggtaaaaatttgatt 5100 atccttttct agttatatgg caccaatatc caaagagttc aaagtgtttttaattgttga 5160 aattttaagt gttaactcta aacttaggtt ttagtgggaa cacagtaccttatttgtgta 5220 tgtcctattt attactggct gactttccct gaacaaggga atgtaaaactatagtgagaa 5280 agaagcttat gacttggggg attatattaa agaggccctt gttagaactgataggtgcat 5340 ggagaagcat cctgaaatcg atgtgcttaa agcagaatgt aaaagattaatcatgatgta 5400 gtaattgagt cattttttga aaaacagttg ttgaaagatt ggcttttgttagcaacaact 5460 ggtaggatgt ttttcagttt aagtgcagtc tgacatttta agcttaggacatttgggggt 5520 tttacggtat tggtgactac aagaaaggga ttggttagta ctctttctttaatagaattt 5580 ctcatgtttt gacagccgat caaagtccag atctccatct ccaaaaagaaggtaagctaa 5640 atgttttgtt gccaaatctt gcctgtcaag tgtggcctct gcagaatttgtttgcttact 5700 gctttgcagt ctttgagctc tttggagaat tggtgctata tagattaaaatactatgcta 5760 agtttctgaa atactttttt tttttgattc agtaacatta gtttatacttttgctggaaa 5820 tacttagtca taaaatgtta gggtgattat taagatgtga ttggtcctgtgagtacttgg 5880 tagaaatttt ggtaagatag atgccttttc cccacatgta caatagatacaaagtgtgga 5940 gaaaagtctt ggaaatagtt acctgcctag tgcttcttta tgaccagaaaacttcaaata 6000 gttgtcatat ttatctagtg cttcttaatg accagaagac ttcaaatagttgtcatattt 6060 aactgcaggt tgaccttgca attttgacaa ggaggatagc ctaatttttttttttttctg 6120 ggatggagtt ttcgctctgt ccccaggctt ggagtgcagt ggctcaatcttggctcactg 6180 cagcctccga ttcccgggtt caagcaatta tcctgtctca gcctcttgagcagttgggat 6240 tacaggcacc caccgccaag cctggctaat tttttgtatt tctagtagagacggagtttc 6300 accatgttgg cgaggttggt cttaaactcc tgatcttagg tgatcacctgcctcggcctc 6360 tcccaaagtg ctggggttac aggcgtgagc caccgtgcct ggccagggtagcctaatctt 6420 aagccaggga caaaagatga atatatgtaa gtttcatgtc atttttaggtctttgctata 6480 ggaaattagt accttaggcc acctttgaag ttattgaaag ttagtacatgtacatgagag 6540 tttcaattga cactaattgg atccaaacct aatgtttttc tttttagtcgttccccatca 6600 ggaagtcctc gcagaagtgc aagtcctgaa agaatggact gaagctctcaagttcaccct 6660 ttagggaaaa gttattttgt ttacattatt ataagggatt tgtgatgtctgtaaagtgta 6720 acctaggaaa gataattcaa ccatctaatc aaaatggatc tggattactatgtaaattca 6780 cagcagtaag gataatataa attttgttga atgtatgaac atcatatggtctgaaaatgt 6840 gggtttttat ttggcacatt taaataacat gtttctaact agatttttgatttgtgttca 6900 atattaacac ttcttaattt gatatatttg agagtcagac attataattgttaatcctta 6960 ttcatacata cctacattca gaattgaaag gtgttggtta agtcttgaacatcactattc 7020 tatgcataaa acttggccag gatcttaagg gactttgaaa attccatcttacccttgtag 7080 ctctgggtaa gatgacctga gtcccttatg atacagcctg aatgcatcatgacagatcct 7140 tagttagcta atccgtttga agttggtgtt agtaggtatt gtatgatcagtggtgaagca 7200 agtaggacca ctgatgtgtc taaatgagca tgacaggaac taaacgaaactgattaaatg 7260 tatgagaaat agaaactgat ttctggatga tctttatact aattgcagctttcaggctac 7320 taggtggcat agtgttaatt aggactcccc aagatatggg gagttctactctcaatggtc 7380 ttgtttcttt gctttctaca ttagttaacc agttttatac caaaaaatgcatgtttgagg 7440 aattgtctga aattgggaca aaacaccttc atgtaaacca gctttgcaaaattttccagc 7500 ccagatactc ttcatctatt caaatggatt gtcttattct gagcaaagacctgttgttaa 7560 tcttcaagct aggttttgca gttcccaacc acaacattct tctattttgccaggctggtg 7620 caaagtaatt aaagatgtca atcagaaatg tcaatgagac taaagtggttttgtaaatct 7680 cagctatatt tagcaacact ccatgtagct aatatttttt ggtagcatctggtagacctt 7740 agaatgttac atagccagta ggttctttat tcaaatttta agtatcttaagaatagtagg 7800 gcagtaacag ttacttttga gagttttctg gtcaagcttt taccaggcattctctagcct 7860 tggtacaaaa aaaaaaaaaa cctgctggtt gcgcagatac ctaggcttgtccattttatg 7920 catttcagca aagtcattgg agactattgc aacttgggaa tactggtctgcatcaagttt 7980 aattcggtag tttgaccgct agtatgttgg aagttatttg gattgtttttggaattttga 8040 ctggctgaat tatggttggt ataaagttat gtgtataact ggcaggcttatttatctgtt 8100 gcacttggtt agctttaatt gttctgtatt atttaaagat aagtttactcaacaataaat 8160 ctgcagagat tgaacaaata atcctgatac ttaatttttg gaagtgggagctc 8213 59 2042 DNA Homo sapiens 59 gcgcctgtca gggaagcggc gcgcgcgcgcgggcggcggg cgggctgggg atccgccgcg 60 cagtgccagc gccagcgcca gacccgcgccccgcgctctc cggcccgtcg cctgccttgg 120 gactcgcgag cccgcactcc cgccctgcctgttcgctgcc cgagtatgga gctgctgtgt 180 tgcgaaggca cccggcacgc gccccgggccgggccggacc cgcggctgct gggggaccag 240 cgtgtcctgc agagcctgct ccgcctggaggagcgctacg taccccgcgc ctcctacttc 300 cagtgcgtgc agcgggagat caagccgcacatgcggaaga tgctggctta ctggatgctg 360 gaggtatgtg aggagcagcg ctgtgaggaggaagtcttcc ccctggccat gaactacctg 420 gatcgctacc tgtcttgcgt ccccacccgaaaggcgcagt tgcagctcct gggtgcggtc 480 tgcatgctgc tggcctccaa gctgcgcgagaccacgcccc tgaccatcga aaaactgtgc 540 atctacaccg accacgctgt ctctccccgccagttgcggg actgggaggt gctggtccta 600 gggaagctca agtgggacct ggctgctgtgattgcacatg atttcctggc cttcattctg 660 caccggctct ctctgccccg tgaccgacaggccttggtca aaaagcatgc ccagaccttt 720 ttggccctct gtgctacaga ttatacctttgccatgtacc cgccatccat gatcgccacg 780 ggcagcattg gggctgcagt gcaaggcctgggtgcctgct ccatgtccgg ggatgagctc 840 acagagctgc tggcagggat cactggcactgaagtggact gcctgcgggc ctgtcaggag 900 cagatcgaag ctgcactcag ggagagcctcagggaagcct ctcagaccag ctccagccca 960 gcgcccaaag ccccccgggg ctccagcagccaagggccca gccagaccag cactcctaca 1020 gatgtcacag ccatacacct gtagccctggagaggccctc tggagtggcc actaagcaga 1080 ggaggggccg ctgccaccca cctccctgcctccaggaacc acaccacatc taagcctgaa 1140 ggggcgtctg ttcccccttc acaaagcccaagggatctgg tcctacccat ccccgcagtg 1200 tgcactaagg ggcccggcca gccatgtctgcatttcggtg gctagtcaag ctcctcctcc 1260 ctgcatctga ccagcagcgc ctttcccaactctagctggg ggtgggccag gctgatggga 1320 cagaattgga tacatacacc agcattccttttgaacgccc cccccccacc cctgggggct 1380 ctcatgtttt caactgccaa aatgctctagtgccttctaa aggtgttgtc ccttctaggg 1440 ttattgcatt tggattgggg tccctctaaaatttaatgca tgatagacac atatgagggg 1500 gaatagtcta gatggctcct ctcagtactttggaggcccc tatgtagtcc gtgctgacag 1560 ctgctcctag agggaggggc ctaggcctcagccagagaag ctataaattc ctctttgctt 1620 tgctttctgc tcagcttctc ctgtgtgattgacagctttg ctgctgaagg ctcattttaa 1680 tttattaatt gctttgagca caactttaagaggacataat gggggcctgg ccatccacaa 1740 gtggtggtaa ccctggtggt tgctgttttcctcccttctg ctactggcaa aaggatcttt 1800 gtggccaagg agctgctata gcctggggtggggtcatgcc ctcctctccc attgtccctc 1860 tgccccatcc tccagcaggg aaaatgcagcagggatgccc tggaggtggc tgagcccctg 1920 tctagagagg gaggcaagcc ctgttgacacaggtctttcc taaggctgca aggtttaggc 1980 tggtggccca ggaccatcat cctactgtaataaagatgat tgtgaaataa aactggcttt 2040 gg 2042 60 1783 DNA Homo sapiens60 cctctcggag ctggaaatgc agctattgag atcttcgaat gctgcggagc tggaggcgga 60ggcagctggg gaggtccgag cgatgtgacc aggccgccat cgctcgtctc ttcctctctc 120ctgccgcctc ctgtgtcgaa aataactttt ttagtctaaa gaaagaaaga caaaagtagt 180cgtccgcccc tcacgccctc tcttcctctc agccttccgc ccggtgagga agcccggggt 240ggctgctccg ccgtcggggc cgcgccgccg agccccagcg ccccgggccg cccccgcacg 300ccgcccccat gcatcccttc tacacccggg ccgccaccat gataggcgag atcgccgccg 360ccgtgtcctt catctccaag tttctccgca ccaaggggct gacgagcgag cgacagctgc 420agaccttcag ccagagcctg caggagctgc tggcagaaca ttataaacat cactggttcc 480cagaaaagcc atgcaaggga tcgggttacc gttgtattcg catcaaccat aaaatggatc 540ctctgattgg acaggcagca cagcggattg gactgagcag tcaggagctg ttcaggcttc 600tcccaagtga actcacactc tgggttgacc cctatgaagt gtcctacaga attggagagg 660atggctccat ctgtgtgctg tatgaagcct caccagcagg aggtagcact caaaacagca 720ccaacgtgca aatggtagac agccgaatca gctgtaagga ggaacttctc ttgggcagaa 780cgagcccttc caaaaactac aatatgatga ctgtatcagg ttaagatata gtctgtggat 840ggatcatctg atgatgatcc ataaatttga tttttgcttt gggtgggctc ctcttgggga 900tggattatgg aatttaaacc atgtcacagc tgtgaagatc tggcacaaga tagaatggta 960aaaaaaaaaa aaaattttaa gtgacagtgc catagtttgg acagtacctt tcaatgatta 1020attttaatag cctgtgagtc caagtaaatg atcactttat ttgctaggga gggaagtcct 1080agggtggttt cagtttctcc cagacatacc taaattttta catcaatcct tttaaagaaa 1140atctgtattt caaagaatct ttctctgcag taaatctcgc aggggaattt gcactattac 1200acttgaaagt tgttattgtt aaccttttcg gcagctttta ataggaaagt taaacgtttt 1260aaacatggta gtactggaaa ttttacaaga cttttaccta gcacttaaat atgtataaat 1320gtacataaag acaaactagt aagcatgacc tggggaaatg gtcagacctt gtattgtgtt 1380tttggccttg aaagtagcaa gtgaccagaa tctgccatgg caacaggctt taaaaaagac 1440ccttaaaaag acactgtctc aactgtggtg ttagcaccag ccagctctct gtacatttgc 1500tagcttgtag ttttctaaga ctgagtaaac ttcttatttt tagaaagtgg aggtctggtt 1560tgtaactttc cttgtactta attgggtaaa agtcttttcc acaaaccacc atctattttg 1620tgaactttgt tagtcatctt ttatttggta aattatgaac tggtgtaaat ttgtacagtt 1680catgtatatt gattgtggca aagttgtaca gatttctata ttttggatga gaaatttttc 1740ttctctctat aataaatcgt ttcttatctt ggcattttta acc 1783 61 1433 DNA Homosapiens 61 ttggacagcc cgggcaacct cgacaccctg caggcgaaaa agaacttctccgtcagtcac 60 ctgctagacc tggaggaagc cggggacatg gtggcggcac aggcggatgagaacgtgggc 120 gaggctggcc ggagcctgct ggagtcgccg ggactcacca gcggcagcgacaccccgcag 180 caggacaatg accagctgaa ctcagaagaa aaaaagaaga gaaagcagcgaaggaatagg 240 acaaccttca atagcagcca gctgcaggct ttggagcgtg tctttgagcggacacactat 300 cctgatgctt ttgtgcgaga agaccttgcc cgccgggtga acctcaccgaggcgagagtg 360 caggtgtggt ttcagaaccg aagagccaag ttccgcagga atgagagagccatgctagcc 420 aataaaaacg cttccctcct caaatcctac tcaggagacg tgactgctgtggagcagccc 480 atcgtacctc gtcctgctcc gagacccacc gattatctct cctgggggacagcgtctccg 540 tacagatcct cgtccctccc aagatgttgt ttacacgagg ggcttcataacggattctaa 600 cggaagacac tgaaaagcgc catggctact tattctgcca catgtgccaacaatagccct 660 gcacagggca tcaacatggc caacagcatt gccaacctga gactgaaggccaaggaatat 720 agtttacaga ggaaccaggt gccaacagtc aactgaggaa aaaaaataattaaacaggcc 780 taagaagaaa tcaaaaacca taagacacct atcctgctct gttatttcttcatctgctgg 840 ggggaaaaag taaattacaa acaaacaaac aaagcagaac taaaatattgggaccatggc 900 agagaaaagc aggagaggag caaaatgaaa attagttaac aaatgttcctcctcctctgg 960 gataccacca ccacttgttt ctgtgtgtgt ttattttgtt tttctttcattcatgctttg 1020 cttaatgtac tccaggcttc ttcagctagg ttcagcccac ccacccccatgcttgtaatc 1080 ccagtgcttt gggaggccaa ggcaggtgga tcacctgagg tcaggagttcgagactagcc 1140 tgttccactg acatttctta gacattcagc aaaaccccca ccttaacctcttttctttct 1200 tgagggttgg tcctgtcccc acctccaccc tcccaccccc tggaagaggaagggcccggg 1260 catcagtggc tagtccaaat aaaatatggg cttggggatg gaatgggtggtggtaagttc 1320 acagagtgta gttagatccc aactcccatg acctctggct tcagtggtgggtggggcagg 1380 gcagatgaaa gggcttcagt gggaacctct gagagcattt tcctgttcccaat 1433 62 643 DNA Homo sapiens 62 ggtagcgacg gtagctctag ccgggcctgagctgtgctag cacctccccc aggagaccgt 60 tgcagtcggc cagccccctt ctccacggtaaccatgtgcg accgaaaggc cgtgatcaaa 120 aatgcggaca tgtcggaaga gatgcaacaggactcggtgg agtgcgctac tcaggcgctg 180 gagaaataca acatagagaa ggacattgcggctcatatca agaaggaatt tgacaagaag 240 tacaatccca cctggcattg catcgtggggaggaacttcg gtagttatgt gacacatgaa 300 accaaacact tcatctactt ctacctgggccaagtggcca ttcttctgtt caaatctggt 360 taaaagcatg gactgtgcca cacacccagtgatccatcca gaaacaagga ctgcagccta 420 aattccaaat accagagact gaaattttcagccttgctaa gggaacatct cgatgtttga 480 acctttgttg tgttttgtac agggcattctctgtactagt ttgtcgtggt tataaaacaa 540 ttagcagaat agcctacatt tgtatttattttctattcca tacttctgcc cacgttgttt 600 tctctcaaaa tccattcctt taaaaaataaatctgatgca ccg 643 63 4792 DNA Homo sapiens 63 ctcaaatatg tggatgacatacagaaggga aataccatca aaagactgaa catccagaag 60 aggcggaagc cgtccgtgccatgcccagaa cccaggacca catctggtca gcaaggtata 120 tggacttcca ctgaatccctctcatcctcc aacagtgatg acaacaagca gtgccccaac 180 ttcctcatag ccagaagtcaagttacatca actccaatct caaagccacc tccccctctg 240 gagacctcac tcccttttcttaccatccca gaaaatcgac agctgccacc tccctcacca 300 caactcccaa agcataaccttcatgtcacc aagacactga tggagacccg gagaagactg 360 gaacaggaga gagccaccatgcagatgaca ccgggtgagt tcagaaggcc caggctggcc 420 agttttggag gcatgggcaccacaagctcc ctcccttctt ttgtgggttc tggaaaccac 480 aatcctgcca agcaccagcttcagaatgga taccaaggta atggggatta tggtagctat 540 gccccagctg ctcccaccacttcctccatg gggagctcca tccgccacag ccccctgagc 600 tcagggatct ccaccccagtgaccaacgtg agccccatgc acctgcagca catccgcgag 660 cagatggcca ttgctctgaaacgcctgaag gagctggagg agcaggtgcg aaccatccct 720 gtgctccagg taaagatctctgtcttgcaa gaagagaaaa ggcagttggt ctcacagctg 780 aaaaaccaaa gggctgcatcccagatcaat gtctgtggtg tgaggaagcg gtcctatagt 840 gcggggaacg cctcccagctggaacagctc tcccgggccc gaagaagtgg cggggaatta 900 tacattgact atgaggaggaagaaatggag accgtagaac agagcacgca gaggataaag 960 gagttccggc aacttacagcagacatgcaa gccctggagc agaagatcca ggacagcagc 1020 tgtgaggcct cctcagagctcagggagaat ggagagtgcc ggtctgtggc tgtgggtgcc 1080 gaggagaaca tgaacgacatcgtcgtgtac cacagaggct ccaggtcctg taaggatgca 1140 gctgtaggga cacttgttgagatgagaaat tgtggggtca gcgtgacaga ggccatgctt 1200 ggagtgatga ctgaagctgacaaagaaatt gagctccaac agcagaccat agaagccttg 1260 aaggaaaaga tctatcgcctagaagtacag cttagagaaa ccacccatga ccgggagatg 1320 actaaactga aacaagagctgcaggctgct ggatcgagga aaaaggttga caaagccacg 1380 atggcccagc cgcttgttttcagtaaggtg gtggaggcag tggtgcagac cagagaccaa 1440 atggtcggca gtcacatggacctggtggac acgtgtgttg ggacctccgt ggaaacaaac 1500 agtgtaggca tctcctgccagcctgaatgt aagaataaag tcgtagggcc tgagctgcct 1560 atgaattggt ggattgttaaggagagggtg gaaatgcatg accgatgtgc tgggaggtct 1620 gtggaaatgt gtgacaagagtgtgagtgtg gaagtcagcg tctgcgaaac aggcagcaac 1680 acagaggagt ctgtgaacgacctcacactc ctcaagacaa acttgaatct caaagaagtg 1740 cggtctatcg gttgtggagattgttctgtt gacgtgaccg tctgctctcc aaaggagtgc 1800 gcctcccggg gcgtgaacactgaggctgtt agccaggtgg aagctgccgt catggcagtg 1860 cctcgtactg cagaccaggacactagcaca gatttggaac aggtgcacca gttcaccaac 1920 accgagacgg ccaccctcatagagtcctgc accaacactt gtctaagcac tttggacaag 1980 cagaccagca cccagactgtggagacgcgg acagtagctg taggagaagg ccgtgtcaag 2040 gacatcaact cctccaccaagacgcggtcc attggtgttg gaacgttgct ttctggccat 2100 tctgggtttg acaggccatcagctgtgaag accaaagagt caggtgtggg gcagataaat 2160 attaacgaca actatctggttggtctcaaa atgaggacta tagcttgtgg gccaccacag 2220 ttgactgtgg ggctgacagccagcagaagg agcgtggggg ttggggatga ccctgtaggg 2280 gaatctctgg agaacccccagcctcaagct ccacttggaa tgatgactgg cctggatcac 2340 tacattgagc gtatccagaagctgctggca gaacagcaga cactgctggc tgagaactac 2400 agtgaactgg cagaagctttcggggaacct cactcacaga tgggctccct caactctcag 2460 ctcatcagca ccctgtcgtctatcaactct gtcatgaaat ctgcaagcac tgaagagctg 2520 aggaaccctg acttccagaaaaccagtctg ggtaaaatca caggcaatta tttgggatat 2580 acctgtaagt gtgggggccttcagtcagga agtcccttaa gctcccagac atcccagcct 2640 gagcaagaag tggggacctcagaaggaaag ccaatcagca gcctggatgc cttccccact 2700 caggaaggta cgctgtctccagtgaacctg acagacgacc agatcgccgc tggcctctat 2760 gcatgtacaa acaatgaaagtacactgaag tccatcatga agaagaaaga tggtaacaaa 2820 gattcaaatg gcgcaaaaaagaatcttcag tttgttggca ttaatggagg gtatgaaaca 2880 acttcaagtg atgattccagctcagatgaa agctcttctt ccgagtcaga tgacgagtgt 2940 gatgtcattg agtatcctcttgaagaagag gaggaggagg aggatgaaga cactcgggga 3000 atggcagaag ggcaccatgcagttaatatt gaaggtttga agtctgccag ggtggaagat 3060 gaaatgcagg ttcaagaatgtgaacctgag aaggtggaaa tcagagagag gtatgaatta 3120 agtgaaaaga tgttgtctgcatgcaactta ctgaaaaata ctataaatga ccccaaagct 3180 ttgaccagca aagatatgaggttctgtctg aacaccctcc agcacgagtg gttccgcgtg 3240 tccagtcaga agtcagccattccagccatg gtgggggact acatagctgc ttttgaggcc 3300 atttccccag atgtcctccgctatgtcatc aacttggcag acggcaacgg caacacagcc 3360 ctccattaca gcgtgtcccactccaacttc gagattgtga agctgctgtt agatgccgat 3420 gtgtgtaatg tggatcaccagaacaaggca ggctacaccc ccatcatgtt ggcggccctc 3480 gccgctgtgg aagcagagaaggacatgcgg attgtggaag aactctttgg ctgtggggat 3540 gtgaatgcca aagctagtcaggcgggacag acggccctca tgctggcggt cagtcacgga 3600 cggatagaca tggtgaagggccttctggcc tgtggggctg atgtcaacat ccaggatgac 3660 gagggctcca cggccctcatgtgtgccagc gagcacgggc acgtggagat tgtcaagctg 3720 ctgctggccc agcccggctgcaacggtcac ctagaggaca acgatggcag cactgcgctc 3780 tcaatcgccc tggaagcaggacacaaggac atcgctgttc ttctgtatgc ccatgtcaac 3840 tttgcaaaag cccagtctccgggcacccct aggcttggaa ggaagacgtc tcctggcccc 3900 acccaccgag gttcatttgattgattgtat gcaaatagcc ctttatttac atgccactat 3960 taagctgcta attgttcctgttggggtgac agatactgaa tgtatacgta ttgtgcctga 4020 gctcaccagc aaacagaagcatcaagccca ggggtaaagg ctgaagcttt cacagtgcag 4080 agactgctag cctgggcacacgcacctcct ttctggccgt cttctgtgta gggcacactt 4140 taacccagtc tctgttgctgttgagtctct gctccgtttt gtacagtcac agggaattct 4200 gatctgaagg ggcaccttctgttcactccc acaaagtggt gtctggttct cactgagacg 4260 ttttaagatt tttccacaaatatttatatg tactaaatgt ggaaccatta gaaagttctt 4320 ccaaaatctc attccagcatagttttggat ttttcttttg tcttatttta aaataaggaa 4380 gtcgagatga ctttgatcattggtaacttg ggcctgggcc agacaaagta taaaacttac 4440 aaaagaatat tctcatttggtcttaactag gtagatgtaa tatatgactt tttataaaaa 4500 gggtatctat atgaacttgacacagtattt tcagcttttg tattccatac taaagccatg 4560 aagaactaca cgtaacatcatcatttgtat taattgcaca actccaatgc taaaggttgg 4620 attgtgttag aggaatcggctctgtatttg cctctagaga aacacagtgt tctctttgta 4680 tttatggatt cctttttaccgtgtcacatt tactttggtc ctctatgtat ttaaatgttt 4740 gaagtgcctt agactcttgccatattttca aaataaaatt ccattaagct ct 4792 64 2199 DNA Homo sapiens 64gtcgccgctg ccgggttgcc agcggagtcg cgcgtcggga gctacgtagg gcagagaagt 60catggcttct ccgtccaaag gcaatgactt gttttcgccc gacgaggagg gcccagcagt 120ggtggccgga ccaggcccgg ggcctggggg cgccgagggg gccgcggagg agcgccgcgt 180caaggtctcc agcctgccct tcagcgtgga ggcgctcatg tccgacaaga agccgcccaa 240ggaggcgtcc ccgctgccgg ccgaaagcgc ctcggccggg gccaccctgc ggccactgct 300gctgtcgggg cacggcgctc gggaagcgca cagccccggg ccgctggtga agcccttcga 360gaccgcctcg gtcaagtcgg aaaattcaga agatggagcg gcgtggatgc aggaacccgg 420ccgatattcg ccgccgccaa gacatacgag ccctaccacc tgcaccctga ggaaacacaa 480gaccaatcgg aagccgcgca cgccctttac cacatcccag ctcctcgccc tggagcgcaa 540gttccgtcag aaacagtacc tctccattgc agagcgtgca gagttctcca gctctctgaa 600cctcacagag acccaggtca aaatctggtt ccagaaccga agggccaagg cgaaaagact 660gcaggaggca gaactggaaa agctgaaaat ggctgcaaaa cctatgctgc cctccagctt 720cagtctccct ttccccatca gctcgcccct gcaggcagcg tccatatatg gagcatccta 780cccgttccat agacctgtgc ttcccatccc gcctgtggga ctctatgcca cgccagtggg 840atatggcatg taccacctgt cctaaggaag accagatcaa tagactccat gatggatgct 900tgtttcaaag ggtttcctct ccctctccac gaaggcagta ccagccagta ctcctgctct 960gctaaccctg cgtgcaccac cctaagcggc taggctgaca gggccacacg acatagctga 1020aatttcgttc tgtaggcgga ggcaccaagc cctgttttct tggtgtaatc ttccagatgc 1080ccccttttcc tttcacaaag attggctctg atggttttta tgtataaata tatatatata 1140ataaaatata atacattttt atacagcaga cgtaaaaatt caaattattt taaaaggcaa 1200aatttatata catatgtgct ttttttgtat atctcacctt cccaaaagac actgtgtaag 1260tccatttgtt gtattttctt aaagagggag acaaattatt tgcaaaatgt gctaaagtca 1320atgattttta cgggattatt gacttctgct tatggaaaac aaagaaacag acacagtgca 1380cacagaaaat attagatatg gagagattat tcaaagtgaa ggggacacat catatttctg 1440cattttactt gcattaaaag aaacctcttt atatactaca gttgttccta tttttccccc 1500gccccccacc gccccaccac acacatattt ttaaagtttt tcctttttta agaatatttt 1560tgtaagacca atacctggga tgagaagaat cctgagactg cctggaggtg aggtagaaaa 1620ttagaaatac ttcctaattc ttctcaaggc tgttggtaac tttatttcag ataattggag 1680agtaaaatgt taaaacctgt tgagaggaat tgatggtttc tgagaaatac taggtacatt 1740catcctcaca gattgcaaag gtgatttggg tgggggttta gtaattttct gcttaaaaaa 1800tgagtatctt gtaaccatta cctatatgct aaatattctt gaacaattag tagatccaga 1860aagaaaaaaa aaatatgctt tctctgtgtg tgtacctgtt gtatgtccta aacttattag 1920aaaattttat atactttttt acatgttggg gggcagaagg taaagccatg ttttgacttg 1980gtgaaaatgg ggttgtcaaa cagcccatta agctccctgg tatttcacct tcctgtccat 2040ctctcccctc cctccggtat acctttatcc ctttgaaagg gtgcttgtac aatttgatat 2100attttattga agagttatct cttattctga attaaattaa gcatttgttt tattgcagta 2160aagtttgtcc aaactcacaa ttaaaaaaaa aaaaaaaaa 2199 65 1496 DNA Homo sapiens65 tcactaaagg gaacaaaagc tggagctcca ccgcggtggc ggcccctcag aactagtgga 60tcccccgggc tgcaaggaat tcggcacgag cgcgcgtcct gcccgtctgt ccccgcgggg 120gtcgcccgcc acagcccgcg gaatgaccac ccagcagata gacctccagg gcccggggcc 180gtggggcttc cgcctcgtgg ggcgaaagga cttcgagcag cctctcgcca tttcccgggt 240cactcctgga agcaaggcgg ctctagctaa tttatgtatt ggagatgtaa tcacagccat 300tgatggggaa aatactagca atatgacaca cttggaagct cagaacagaa tcaaaggctg 360cacagacaac ttgactctca ctgtagccag atctgaacat aaagtctggt ctcctctggt 420gacggaggaa gggaagcgtc atccatacaa gatgaattta gcctctgaac cccaggaggt 480cctgcacata ggaagcgccc acaaccgaag tgccatgccc tttaccgcct cgcctgcctc 540cagcactact gccagggtca tcacaaacca gtacaacaac ccagctggcc tctactcttc 600tgaaaatatc tccaacttca acaatgccct ggagtcaaag actgctgcca gcggggtgga 660ggcgaacagc agacccttag accatgctca gcctccaagc agccttgtca tcgacaaaga 720atctgaagtt tacaagatgc ttcaggagaa acaggagttg aatgagcccc cgaaacagtc 780cacgtctttc ttggttttgc aggaaatcct ggagtctgaa gaaaaagggg atcccaacaa 840gccctcagga ttcagaagtg ttaaagctcc tgtcactaaa gtggctgcgt cgattggaaa 900tgctcagaag ttgcctatgt gtgacaaatg tggcactggg attgttggtg tgtttgtgaa 960gctgcgggac cgtcaccgcc accctgagtg ttatgtgtgc actgactgtg gcaccaacct 1020gaaacagaag ggccatttct ttgtggagga tcaaatctac tgtgagaagc atgcccggga 1080gcgagtcaca ccacctgagg gttatgaagt ggtcactgtg ttccccaagt gagccagcag 1140atctgaccac tgttctccag caggcctctg ctgcagcttt tctctcagtg ttctggccct 1200ctcctctctt gaaagttctc tgcttacttt ggttttccct ctgcttgtaa aacattgagg 1260cccctccctg ccttggttaa ttgactcaca ccagctgtgg gatgcccgct tttacaatta 1320aaggaaaact gttgtgttca gtgtcacctt gtcagcaaca ctgtgtccct tcgcccgccg 1380ttcttctctg ctgcatttgg acatcagcca aatttgaacc caatcaaata taacgtgtct 1440gacactgatt ttgtttttac tcaataaatg tatagactac aaaaaaaaaa aaaaaa 1496 665421 DNA Homo sapiens 66 ccgggatccg gttttttttg tttttaaaag tgtaatttcctttttatttg catctgttta 60 tgactgaaaa aaatgactag ttattatgaa gacactactgttgaagatgg atattttaac 120 atggagtttc aacaaaatta cttcttgaga cagagctgatgtgtttttta aataacgtga 180 ttttaagcat atatttgaac aaaactaaaa catttagtattatgaatatg aaaaaagatc 240 agtaaatcaa tgtactcttc taggctgaat taaggtagactatttaaggt ttcaaaaaag 300 tttggctggg gcagaataag ttttacaaaa cccatgccatccaaaattaa gatgacatgt 360 agcagcaaga agtattccaa tgtctcataa ccagttctcgcaagcaatgt gtattcctta 420 ctttaaggaa gtgtcaaaca aatagaaaaa tctggaagaatttactaagt gtaataaatt 480 agaggtaaat cgtaataaaa gaatttatgt ctcacaaaaatattcacaag tgggagtttt 540 cttttaccaa cttctcagag tccttctagc cccctcttcacttctgaaag atgggattta 600 ccaaaatctg gtttacattt aacttttcag ggacacatgacctgaaaaga aagatgtcag 660 ataatactga cattgcctca tgcactttct ttgtatcagtccttcttctg taagtaatca 720 gaattgggtc caaatggcat agaatcaaac attatgtatcatgccaaata ccacttcctg 780 cccaacaaaa tttcatcttt ctccagtaat gaagaggtggacattcttgt tggactgtag 840 catctgtgcc gcccgctcca caccaaccac ggcagctaacctctgggcat catatttgga 900 gtagagaaca gtgcaggtcc acgtggcctc ttctcctctgttggtggctc tcagcatatt 960 acagatttca ctgtaaaagt gtggatatgt cggcagttcatagaaaatca ggttcctgat 1020 gccttttatt gctgtagttt atttccaccc ccttccctcctgttttctct ctctccttct 1080 ctctctctct ctctctctct tttttttccg ccctagctggggctgtgttg gaggagagga 1140 agaaagagag acagaggatt gcattcatcc gttacgttcttgaaatttcc taatagcaag 1200 accagcgaag cggttgcacc cttttcaatc ttgcaaaggaaaaaaacaaa acaaaacaaa 1260 aaaaacccaa gtccccttcc cggcagtttt tgccttaaagctgccctctt gaaattaatt 1320 ttttcccagg agagagatgt cttatcaggg gaagaaaaatattccacgca tcacgagcga 1380 tcgtcttctg atcaaaggag gtaaaattgt taatgatgaccagtcgttct atgcagacat 1440 atacatggaa gatgggttga tcaagcaaat aggagaaaatctgattgtgc caggaggagt 1500 gaagaccatc gaggcccact cccggatggt gatccccggaggaattgacg tccacactcg 1560 tttccagatg cctgatcagg gaatgacgtc tgctgatgatttcttccaag gaaccaaggc 1620 ggccctggct gggggaacca ctatgatcat tgaccacgttgttcctgagc ctgggacaag 1680 cctgctcgct gcctttgacc agtggaggga atgggccgacagcaagtcct gctgtgacta 1740 ctctctgcat gtggacatca gcgagtggca taagggcatccaggaggaga tggaagcgct 1800 tgtgaaggat cacggggtaa attccttcct cgtgtacatggctttcaaag atcgcttcca 1860 gctaacggat tgccagattt atgaagtact gagtgtgatccgggatattg gcgccatagc 1920 ccaagtccac gcagaaaatg gcgacatcat tgcagaggagcagcagagga tcctggatct 1980 gggcatcacg ggccccgagg gacatgtgct gagccgacctgaggaggtcg aggccgaagc 2040 cgtgaatcgt gccatcacca tcgccaacca gaccaactgcccgctgtata tcaccaaggt 2100 gatgagcaaa agctctgctg aggtcatcgc ccaggcacggaagaagggaa ctgtggtgta 2160 tggcgagccc atcactgcca gcttgggaac ggacggctcccattactgga gcaagaactg 2220 ggccaaggct gctgcctttg tcacctcccc acccttgagccctgatccaa ccactccaga 2280 ctttctcaac tccttgctgt cctgtggaga cctccaggtcacgggcagtg cccattgcac 2340 gtttaacact gcccagaagg ctgtaggaaa ggacaacttcaccctgattc cggagggcac 2400 caatggcact gaggagcgga tgtccgtcat ctgggacaaggctgtggtca ctgggaagat 2460 ggatgagaac cagtttgtgg ctgtgaccag caccaatgcagccaaagtct tcaaccttta 2520 cccccggaaa ggccgcattg ctgtgggatc cgatgccgacctggtcatct gggaccccga 2580 cagcgttaaa accatctctg ccaagacaca caacagctctctcgagtaca acatctttga 2640 aggcatggag tgccgcggct ccccactggt ggtcatcagccaggggaaga ttgtcctgga 2700 ggacggcacc ctgcatgtca ccgaaggctc tggacgctacattccccgga agcccttccc 2760 tgattttgtt tacaagcgta tcaaggcaag gagcaggctggctgagctga gaggggttcc 2820 tcgtggcctg tatgacggac ctgtgtgtga agtgtctgtgacgcccaaga cagtcactcc 2880 agcctcctcg gccaagacgt ctcctgccaa gcagcaggccccacctgtcc ggaacctgca 2940 ccagtctgga ttcagtttgt ctggtgctca gattgatgacaacattcccc gccgcaccac 3000 ccagcgtatc gtggcgcccc ccggtggccg tgccaacatcaccagcctgg gctagagctc 3060 ctgggctgtg cgtccactgg ggactgggga tgggacacctgaggacattc tgagacttct 3120 ttcttccttc cttttttttt tttgtttttt tttttaagagcctgtgatag ttactgtgga 3180 gcagccagtt catggggtcc cccttgggcc cacaccccgtctctcaccaa gagttactga 3240 ttttgctcat ccacttccct acacatctat gggtatcacacccaagacta cccaccaagc 3300 tcatacaggg aaccacaccc aacacttaga catgcgaacaagcagccccc agcgagggtc 3360 tccttcgcct tcaacctcct agtgtctgtt agcattccttttcatggggg gagggaagat 3420 aaagtgaatt gcccagagct gcctttttct tttctttttaaaaattttaa gaagttttcc 3480 ttgtggggct ggggaggggc cggggtcagg gagagtcttttttttttttt ttttaaatac 3540 taaattggaa catttaattc catattaata caaggggtttgaactggaca tcctaatgat 3600 gcaattacgt catcacccag ctgattccgg gtggttggcaaactcatcgt gtctgtcctg 3660 agaggctcca caatgcccac ccgcatcgcc attctgtagtcttcagggtc agctgttgat 3720 aaaggggcag gcttgcgtta ttggcctaga ttttgctgcagattaaatcc tttgaggatt 3780 ctcttctctt ttaccatttt tctgcgtgct ctcactctctctttctctct ctagcttttt 3840 aattcatgaa tattttcgtg tctgtctctc tctctctctgtgtttcctcc agcccttgtc 3900 tcggagacgg tgttttcctc ccttgcccca ttatcttttcacctcccagg tctacatttc 3960 atggtggtcg ttgggtccgc ctaaaggatt tgagcgtttgccattgcaag catagtgctg 4020 tgtcatcctg gtccatgtag gactggtgct aaccacctgccatcatgagg atgtgtgcta 4080 gagtgtggga ccctggccaa gtgcaggaat gggccatgccgtctcaccca cagtatcaca 4140 cgtggaaccg cagacagggc ccagaagctt tagaggtatgaggctgcaga accggagaga 4200 ttttcctctg tgcagtgctc tctggctaaa gtcacggtcaaacctaaaca ccgagcctca 4260 ttaacccaag tgaaccaacc aaagtcacca gttcagaagtgctaagctaa taggagtctg 4320 acccgagggc ctgctgcttc ctggttaagt atcttttgagattctagaac acatgggagc 4380 tttttatttt cggggaaaaa ccgtattttt ttcttgtccaattatttcta aagacacact 4440 acatagaaag aggccctata aactcaaaaa gtcattgggaaacttaaagt ctattctact 4500 ttgccaagag gagaaatgtg ttttatgaac gatagatcacatcagaactc ctgtggggag 4560 gaaaccttat aaattaaaca catggccccc ttagagaccacaggcgatgt ctgtctccat 4620 ccttccctct ccttttctgt cacctttccc cctagctggctcctttggac ctacccctgt 4680 ccttgctgac ttgtgttgca ttgtattcca aacgtgtttacaggttctct taagcaatgt 4740 tgtatttgca ggcttttctg aataccaaat ctgctttttgtaaagcgtaa aaacatcaca 4800 aagtaggtca ttccatcacc acccttgtct ctctacacattttgcctttg gggatctggt 4860 tggggttttg ggttttttgt tgttgttgtt tatttgttattttaaaggta aattgcactt 4920 ttaaaaaaat aattggttga cttaatatat ttgctttttttctcacctgc acttagagga 4980 aatttgaaca agttggaaaa aaacaatttt tgtttcaattctaagaaaca cttgcagctc 5040 tagtattcac ttgagtcttc ctgtttttcc tgtaccgggtcatggtaatt tttggttgtt 5100 ttggttgttt tcttaaaaaa caagttaaaa cctgacgatttctgcagtga cttgatgctc 5160 taaaacagtg taggatttaa gaatagatgg tttttaatcctggaaattgt gattgtgacc 5220 catgagtgga ggaactttca gttctaaagc tgataaagtgtgtagccaga agagtacttt 5280 ttttttgtaa ccactgtctt gatggcaaaa taattatggtaaaaaacaag tctcgtgttt 5340 attattcctt aagaactctg tgttatatta ccatggaacgcctaataaag caaaatgtgg 5400 ttgtttcaaa aaaaaaaaaa a 5421 67 620 DNA Homosapiens 67 aaacatccta tcatctgtag gctcattcat ttctctaaca gcagcagcaacagcgcatca 60 caggacacca aggagagctc tgaagagcct ccctcagaag agagccaggacacccccatt 120 tacacggagt ttgatgagga tttcgaggag gaacccacat cccccataggtcactgtgtg 180 gccatctacc actttgaagg gtccagcgag ggcactatct ctatggccgagggtgaagac 240 ctcagtctta tggaagaaga caaaggggac ggctggaccc gggtcaggcggaaagaggga 300 ggcgagggct acgtgcccac ctcctacctc cgagtcacgc tcaattgaaccctgccagag 360 acgggaagag gggggctgtc ggctgctgct tctgggccac ggggagccccaggacctatg 420 cactttattt ctgaccccgt ggcttcggct gagacctgtg taacctgctgccccctccac 480 ccccaaccca gtcctacctg tcacaccgga cggacccgct gtgccttctaccatcgttcc 540 accattgatg tacatactca tgttttacat cttttctttc tgcgctcggctccggccatt 600 ttgttttata caaaaatggg 620 68 1266 DNA Homo sapiens 68ctcggaagcc cgtcaccatg tcgtgcgagt cgtctatggt tctcgggtac tgggatattc 60gtgggctggc gcacgccatc cgcctgctcc tggagttcac ggatacctct tatgaggaga 120aacggtacac gtgcggggaa gctcctgact atgatcgaag ccaatggctg gatgtgaaat 180tcaagctaga cctggacttt cctaatctgc cctacctcct ggatgggaag aacaagatca 240cccagagcaa tgccatcttg cgctacatcg ctcgcaagca caacatgtgt ggtgagactg 300aagaagaaaa gattcgagtg gacatcatag agaaccaagt aatggatttc cgcacacaac 360tgataaggct ctgttacagc tctgaccacg aaaaactgaa gcctcagtac ttggaagagc 420tacctggaca actgaaacaa ttctccatgt ttctgtggaa attctcatgg tttgccgggg 480aaaagctcac ctttgtggat tttctcacct atgatatctt ggatcagaac cgtatatttg 540accccaagtg cctggatgag ttcccaaacc tgaaggcttt catgtgccgt tttgaggctt 600tggagaaaat cgctgcctac ttacagtctg atcagttctg caagatgccc atcaacaaca 660agatggccca gtggggcaac aagcctgtat gctgagcagg aggcagactt gcagagcttg 720ttttgtttca tcctgtccgt aaggggtcag cgctcttgct ttgctctttt caatgaatag 780cacttatgtt actggtgtcc agctgagttt ctcttgggta taaaggctaa aagggaaaaa 840ggatatgtgg agaatcatca agatatgaat tgaatcgctg cgatactgtg gcatttccct 900actccccaac tgagttcaag ggctgtaggt tcatgcccaa gccctgagag tgggtactag 960aaaaaacgag attgcacagt tggagagagc aggtgtgtta aatggactgg agtccctgtg 1020aagactgggt gaggataaca caagtaaaac tgtggtactg atggacttaa ccggagttcg 1080gaaaccgtcc tgtgtacaca tgggagttta gtgtgataaa ggcagtattt cagactggtg 1140ggctagccaa tagagttggc aattgcttat tgaaactcat taaaaataat agagccccac 1200ttgacactat tcactaaaat taatctggaa tttaaggccc aacattaaac acaaagctgt 1260attgat 1266 69 3858 DNA Homo sapiens 69 agtctggttt aactggttgg aacgactaaagcacgctggc gcaaggaaag ctctcaactt 60 cgggagctga ggcgcaggct ggccagagcgtggagaggaa agccctttcc atcctcaagg 120 ccgttgcagg agatgcccgc gagccaccttcgccagcacc acaccggggt gtaatggata 180 ggtaacagag aagacctcgt cccttcctagtcagggcatc agcatgactg agtgcttcct 240 gccccccacc agcagcccca gtgaacaccgcagggtggag catggcagcg ggcttacccg 300 gacccccagc tctgaagaga tcagccctactaagtttcct ggattgtacc gcactggcga 360 gccctcacct ccccatgaca tcctccatgagcctcctgat gtagtgtctg atgatgagaa 420 agatcatggg aagaaaaaag ggaaatttaagaaaaaggaa aagaggactg aaggctatgc 480 agcctttcag gaagatagct ctggagatgaggcagaaagt ccttctaaaa tgaagaggtc 540 caagggaatc catgttttca agaagcccagcttttctaaa aagaaggaaa aggattttaa 600 aataaaagag aaacccaaag aagaaaagcataaagaagaa aagcacaaag aagaaaaaca 660 taaagagaag aagtcaaaag acttgacagcagctgatgtt gttaaacagt ggaaggaaaa 720 gaagaaaaag aaaaagccaa ttcaggagccagaggtgcct cagattgatg ttccaaatct 780 caaacccatt tttggaattc ctttggctgatgcagtagag aggaccatga tgtatgatgg 840 cattcggctg ccagccgttt tccgtgaatgtatagattac gtagagaagt atggcatgaa 900 gtgtgaaggc atctacagag tatcaggaattaaatcaaag gtggatgagc taaaagcagc 960 ctatgaccgg gaggagtcta caaacttggaagactatgag cctaacactg tagccagttt 1020 gctgaagcag tatttgcgag accttccagagaatttgctt accaaagagc ttatgcccag 1080 atttgaagag gcttgtggga ggaccacggagactgagaaa gtgcaggaat tccagcgttt 1140 actcaaagaa ctgccagaat gtaactatcttctgatttct tggctcattg tgcacatgga 1200 ccatgtcatt gcaaaggaac tggaaacaaaaatgaatata cagaacattt ctatagtgct 1260 cagcccaact gtgcagatca gcaatcgagtcctgtatgtg tttttcacac atgtgcaaga 1320 actctttgga aatgtggtac taaagcaagtgatgaaacct ctgcgatggt ctaacatggc 1380 cacgatgccc acgctgccag agacccaggcgggcatcaag gaggagatca ggagacagga 1440 gtttcttttg aattgtttac atcgagatctgcagggtggg ataaaggatt tgtctaaaga 1500 agaaagatta tgggaagtac aaagaattttgacagccctc aaaagaaaac tgagagaagc 1560 taaaagacag gagtgtgaaa ccaagattgcacaagagata gccagtcttt caaaagagga 1620 tgtttccaaa gaagagatga atgaaaatgaagaagttata aatattctcc ttgctcagga 1680 gaatgagatc ctgactgaac aggaggagctcctggccatg gagcagtttc tgcgccggca 1740 gattgcctca gaaaaagaag agattgaacgcctcagagct gagattgctg aaattcagag 1800 tcgccagcag cacggccgaa gtgagactgaggagtactcc tccgagagcg agagcgagag 1860 tgaggatgag gaggagctgc agatcattctggaagactta cagagacaga acgaagagct 1920 ggaaataaag aacaatcatt tgaatcaagcaattcatgag gagcgcgagg ccatcatcga 1980 gctgcgcgtg cagctgcggc tgctccagatgcagcgagcc aaggccgagc agcaggcgca 2040 ggaggacgag gagcctgagt ggcgcgggggtgccgtccag ccgcccagag acggcgtcct 2100 tgagccaaaa gcagctaaag agcagccaaaggcaggcaag gagccggcaa agccatcgcc 2160 cagcagggat aggaaggaga cgtccatctgagcagcctgc gtggccgtct ggagtccgtg 2220 agactgaaag gacccgtgca tcttactgtaacccgggggc caggccggct ctctcgctgt 2280 acattctgta aaggtgtctt ctcttctcagactcttcctc tgtcacacgt ctgactcctt 2340 cacgtcaggc tcaggttcca tgggaggacgaagcagtgga cgcattgtgg gctttaggga 2400 cagatgagtt ttccagatag tgtcagcttatttgaagatt aattttcttt gttaacttaa 2460 aataactatt ttaacccttg agtggcttctttttaaacca aaaaccgtct ttctttgctt 2520 ttttatcaca gcagaatcag gatctctttctcattcaagg ggggaaccac accaggtcag 2580 cgctgcgcct gctgtggccg ccgcgagccacgccctctgg gatctctggt accgtcactc 2640 ttgcttgtgc cttccacacc ttctcggtgcagatccctat gggggagctg cctcacgttc 2700 tctgactggt cagagcagcg cctggtgggtgttccctggc ccactctcct ctctccttct 2760 gcagttctaa accacagtct ataagcccgagtcaccagga cggcctgtct ggccacagac 2820 aggggctgcc tgtggagcct gcccaccggcccccggcagt gcagtccagc ggggaggagg 2880 ctgcccgttc ctgccagttc ctcactgcggggaccagcaa aggccttctc actgggttgg 2940 tcaaaggtag tcaccttggc ctggtgcatccacagaggat gttgttcaaa ccagaaatct 3000 tttaaacgac tgaccttcct taaaaacagaatgactccga ttgcttgctt gggctagaat 3060 gtacacgtct ccttgcctga ataagccatatatatgctct taaacaaaag tttgaaatta 3120 tccatatcat ctcagtgaac ctactggtggactcccaatt gacaagattg agcaatagaa 3180 aaaaattcct ttcctttgaa tgatagctgtgattcacccc accccatttt cttgtttctg 3240 gtccatccga tgagacggat gctctgatgctctgaggctt ctgggaggct gggccctgga 3300 ggcaacgtgc tgcaggcgca ctctgtcagagtgaacagca ccgcgagaca ggccaggctc 3360 gtggctcgga agacaaaccc cacacacactcaaggggtcg aaaacaaacc ccacacgagg 3420 gctctcacct ccttctccta ggtagtatttattttcagca cctgtttgat gcagttttta 3480 atcctctacc tattgcactg ttgtgactcgttggccatta tttgattttg gtacgaaaaa 3540 aagctttgtt atagaaatca gcatactatttttttaaatc tggagagaag atattctggt 3600 gactgaaagt atggtcgggt gtcagatataaatgtgcaaa tgccttcttg ctgtcctgtc 3660 ggtctcagta cgttcacttt atagctgctggcaatatcga aggttccttt tttgtttgtg 3720 taaactctaa tttctatcaa ggtgtcatggatttttaaaa ttagtatttc attacaaatg 3780 tctcagcatt ggttaactaa ttttgggcaggaccattatt gatcaagcaa ataaattcaa 3840 cagccatttg ggaaaaag 3858 70 4043DNA Homo sapiens 70 cgaagcgggt cctgccccgc tgtcagctgc ggcccccggcgccgggcggg ggtggccgcg 60 accattggcg gagaggcgaa aggggcgggg ccgccgccagccgctgcggg caaggctgaa 120 caggcggagg tgggcagccg gccagggaag cacggtccaggcggctacat tcggcccggc 180 catggcagcg gcgcccctga aagtgtgcat cgtgggctcggggaactggg gttcagctgt 240 tgcaaaaata attggtaata acgtcaagaa acttcagaaatttgcctcca cagtcaagat 300 gtgggtcttt gaagaaacag tgaatggcag aaaactgacagacatcataa ataatgacca 360 tgaaaatgta aaatatcttc ctggacacaa gctgccagaaaatgtggttg ccatgtcaaa 420 tcttagcgag gctgtgcagg atgcagacct gctggtgtttgtcattcccc accagttcat 480 tcacagaatc tgtgatgaga tcactgggag agtgcccaagaaagcgctgg gaatcaccct 540 catcaagggc atagacgagg gccccgaggg gctgaaactcatttctgaca tcatccgtga 600 gaagatgggt attgacatca gtgtgctgat gggagccaacattgccaatg aggtggctgc 660 agagaagttc tgtgagacca ccatcggcag caaagtaatggagaacggcc ttctcttcaa 720 agaacttctg cagactccaa attttcgaat tacggtggttgatgatgcag acactgttga 780 actctgtggt gcgcttaaga acatcgtagc tgtgggagctgggttctgcg acggcctccg 840 ctgtggagac aacaccaaag cggccgtcat ccgcctgggactcatggaaa tgattgcttt 900 tgccaggatc ttctgcaaag gccaagtgtc tacagccaccttcctagaga gctgcggggt 960 ggccgacctg atcaccacct gttacggagg gcggaaccgcagggtggccg aggccttcgc 1020 cagaactggg aagaccattg aagagttgga gaaggagatgctgaatgggc aaaagctcca 1080 aggaccgcag acttctgctg aagtgtaccg catcctcaaacagaagggac tactggacaa 1140 gtttccattg tttactgcag tgtatcagat ctgctacgaaagcagaccag ttcaagagat 1200 gttgtcttgt cttcagagcc atccagagca tacataaagtgaatcatgca acgtgttggg 1260 ggaagttctg cctttctgat caatcttttg ggttcacgtggaaaccagga cttggcaaca 1320 tgatgtttga ctgtaatctc atcacggata tgtatgaatttttacaggtt cgtttttgaa 1380 ttgtgagagg cagttcatta gcaaagatgt actgggcagtaactaaacac acatgcaaac 1440 atgtgaatgg tggtttattc ctcattctgt ggatgtttctatgagccaaa atttgatgtc 1500 tttttttcaa aattgcttat gaaatttcca cacaatcgtagcttataaga ttggaacgat 1560 ctcagccaaa tattttaggt gtaattcata tgtatttgagtggaggattt tttttctcat 1620 ttttctagtg ttaaatttta accagcatta acatggtagagtggaggagt gagtgtgttc 1680 aaagatcaac atatttaact tttaaacact atctcaaagccagcataatt aactactttg 1740 attgtgggct gacctttgtt tttttaacaa tcaggcatttttaattagat aatccactca 1800 tgtatttccc cctcactgca gttgtctgca tttttagcctcttttctctt cgttagttgt 1860 cagaatatgc ctttgtcaag gctcagaggt aacaagacagaaaattcatc tgggattttc 1920 ctgctgtggc tggcacattc ttctgattaa cagacacttgtatgatgctt taggctagtt 1980 agtgcatttt ttagcaaaca tttatcttaa acatcacagatccactgggg ggtgcaaggg 2040 gctactgtta gtcctcttgt tagatgcagt cactcctcctggtcacctag tgagcaggga 2100 cagagccagg agtcaagtgc agtgccaagg tgcatgaccctctgagaagt cactgggctg 2160 atttgacctc cgactcattg gttgtgtaaa tgccatgtgcagcctttcct gaggccatag 2220 gagggcttcc tgcagctgag atctatgcag gccatcctctcaacaggtgc cactccaagg 2280 gcggtcctcg gtgcagcagc atcagcttca cttgtgggggggtgggggaa ggggcggtct 2340 cagaaatgca ggttcccagg tcccaccctg gacttctgaaggggtgtggc atctgtgttt 2400 ctgatgctta ctacaatatg tgaaccacta ctttagaaaatctgctttaa cttggtattc 2460 ctctaattgt gttccctagg aaatgactgt cccaagagccagtgattatt ccaggtgttc 2520 cctggaaagg tcaagtgagt ctgggaaaca ctatgtctgtacacctcttg aaggtgtcga 2580 atgtatgttt atacatcagt ggaacccatt tttctagcctagcaagtccc aaacacatta 2640 cactgaagag attttggtga ggaaacttgc tggagttttcagggaacact gttctaggct 2700 taggtgacct taggatcact caagtagacc cttcactccctgcgagaaat taggatgaat 2760 aactacctgt ggcattgttg gttctgaact tttacagttcaggcctgctg tgaatctttg 2820 atgaagcttt aaggtgacac tgttgtacaa gatgtcagctttgctgaaac gcacattacc 2880 tggaataagt gctttaattg tagaattaga atgggatttactgtactgtt ttaaatgaga 2940 ttggcttcag aatccattac agttacctta catagcacttgatacgtgtt aaatgaacat 3000 atgaatgtaa tttatatatt cctagaattt aagttactttgtgagatttg ggcctgtccc 3060 tcaatgccag tttaggattt ctttttttct ataccttgaaatgattataa aatagatttt 3120 catgggaatt ttaaaaactc tatccaaaac atttttggagcattttaaag ccccatacac 3180 agaagtatac gaaagcacac aaaacactcc aagtttcagcagttttagcg ccaccattaa 3240 cccactttgc ttgtctcatg aaaaatcttt gttaaagtttgtacacaggt aacaaaaagt 3300 tactttaaaa gatatataaa gggctgtaag ctaattgtggtgtctagtaa gtagcataat 3360 gagatgtgag gagttggaac tttgcgtgtt ttgcgtattttcatctgcat tcagcttctt 3420 actctgggtt tgtactcgag tgttatttct ttacaaatgcccttgtaatt accactctga 3480 agtctgctga ctgtgtctct tgaacatact taggatattctgcacattat ggaaaaaggt 3540 aaattttaga agtttctgct ctactaactg tagatatttatgactctgcg agttatctat 3600 ttttataacc acctgtggtc cattgttcat tttaattcacatttcttatg aagtatggta 3660 acagggaggg agacacctag attagcagct caatttgtactacttcagcc aatctgtgaa 3720 tgtaaaaact acactgttgc cttgctagga tccaccctcctataatatgg aacaaatatc 3780 tgaatgaaat ccaccctagg agacggagtc aaactaaacttgtggttttt catttaactt 3840 ttgactacag catggcccca tggcatccac accaagagggtgttgtgatg aggtgccggt 3900 gtgcaaaggg aactttagtt tttccactgg ttcttatctgctagcctttt acatacatgt 3960 gtactatatt tgtttataga ctgtaggtgg atatataatttaaaagcttg atttaataaa 4020 catttaaccc cctaaacttg ggg 4043 71 2108 DNAHomo sapiens 71 tgttcctcct ccgtcccacc cccataacta tactggctct gatgagaccttggttttctg 60 taaaagctct atttagaggt gtatcattat ttacttaatt gttctcctttacaacccacc 120 tgggatgagc atcttgccta gaagtctcta cttgcacagg atacatacgaaatagattga 180 ggattcaaag cagatacaga actcttccca cttactttct taccctgtgtgtctccccac 240 agggttacaa gtgtataaca agtgttggaa gtttgagcat tgcaatttcaacgacgtcac 300 aacccgcttg agggaaaatg agctaacgta ctactgctgc aagaaggacctgtgtaactt 360 taacgaacag cttgaaaatg gtgggacatc cttatcagag aaaacagttcttctgctggt 420 gactccattt ctggcagcag cctggagcct tcatccctaa gtcaacaccaggagagcttc 480 tcccaaactc cccgttcctg cgtagtccgc tttctcttgc tgccacattctaaaggcttg 540 atattttcca aatggatcct gttgggaaag aataaaatta gcttgagcaacctggctaag 600 atagaggggc tctgggagac tttgaagacc agtcctgttt gcagggaagccccacttgaa 660 ggaagaagtc taagagtgaa gtaggtgtga cttgaactag attgcatgcttcctcctttg 720 ctcttgggaa gaccagcttt gcagtgacag cttgagtggg ttctctgcagccctcagatt 780 atttttcctc tggctccttg gatgtagtca gttagcatca ttagtacatctttggagggt 840 ggggcaggag tatatgagca tcctctctca catggaacgc tttcataaacttcagggatc 900 ccgtgttgcc atggaggcat gccaaatgtt ccatatgtgg gtgtcagtcagggacaacaa 960 gatccttaat gcagagctag aggacttctg gcagggaagt ggggaagtgttccagatagc 1020 agggcatgaa aacttagaga ggtacaagtg gctgaaaatc gagtttttcctctgtcttta 1080 aattttatat gggctttgtt atcttccact ggaaaagtgt aatagcatacatcaatggtg 1140 tgttaaagct atttccttgc ctttttttta ttggaatggt aggatatcttggctttgcca 1200 cacacagtta cagagtgaac actctactac atgtgactgg cagtattaagtgtgcttatt 1260 ttaaatgtta ctggtagaaa ggcagttcag gtatgtgtgt atatagtatgaatgcagtgg 1320 ggacaccctt tgtggttaca gtttgagact tccaaaggtc atccttaataacaacagatc 1380 tgcaggggta tgttttacca tctgcatcca gcctcctgct aactcctagctgactcagca 1440 tagattgtat aaaatacctt tgtaacggct cttagcacac tcacagatgtttgaggcttt 1500 cagaagctct tctaaaaaat gatacacacc tttcacaagg gcaaactttttccttttccc 1560 tgtgtattct agtgaatgaa tctcaagatt cagtagacct aatgacatttgtattttatg 1620 atcttggctg tatttaatgg cataggctga cttttgcaga tggaggaatttcttgattaa 1680 tgttgaaaaa aaacccttga ttatactctg ttggacaaac cgagtgcaatgaatgatgct 1740 tttctgaaaa tgaaatataa caagtgggtg aatgtggtta tggccgaaaaggatatgcag 1800 tatgcttaat ggtagcaact gaaagaagac atcctgagca gtgccagctttcttctgttg 1860 atgccgttcc ctgaacatag gaaaatagaa acttgcttat caaaacttagcattaccttg 1920 gtgctctgtg ttctctgtta gctcagtgtc tttccttaca tcaataggtttttttttttt 1980 tttttggcct gaggaagtac tgaccatgcc cacagccacc ggctgagcaaagaagctcat 2040 ttcatgtgag ttctaaggaa tgagaaacaa ttttgatgaa tttaagcagaaaatgaattt 2100 ctgggaac 2108 72 1938 DNA Homo sapiens 72 attccggttgttgcaccatg gcgtccatgg ggaccctcgc cttcgatgaa tatgggcgcc 60 ctttcctcatcatcaaggat caggaccgca agtcccgtct tatgggactt gaggccctca 120 agtctcatataatggcagca aaggctgtag caaatacaat gagaacatca cttggaccaa 180 atgggcttgataagatgatg gtggataagg atggagatgt gactgtaact aatgatgggg 240 ccaccatcttaagcatgatg gatgttgatc atcagattgc caagctgatg gtggaactgt 300 ccaagtctcaggatgatgaa attggagatg gaaccacagg agtggttgtc ctggctggtg 360 ccttgttagaagaagcggag caattgctag accgaggcat tcacccaatc agaatagccg 420 atggctatgagcaggctgct cgtgttgcta ttgaacacct ggacaagatc agcgatagcg 480 tccttgttgacataaaggac accgaacccc tgattcagac agcaaaaacc acgctgggct 540 ccaaagtggtcaacagttgt caccgacaga tggctgagat tgctgtgaat gccgtcctca 600 ctgtagcagatatggagcgg agagacgttg actttgagct tatcaaagta gaaggcaaag 660 tgggcggcaggctggaggac actaaactga ttaagggcgt gattgtggac aaggatttca 720 gtcacccacagatgccaaaa aaagtggaag atgcgaagat tgcaattctc acatgtccat 780 ttgaaccacccaaaccaaaa acaaagcata agctggatgt gacctctgtc gaagattata 840 aagcccttcagaaatacgaa aaggagaaat ttgaagagat gattcaacaa attaaagaga 900 ctggtgctaacctagcaatt tgtcagtggg gctttgatga tgaagcaaat cacttacttc 960 ttcagaacaacttgcctgcg gttcgctggg taggaggacc tgaaattgag ctgattgcca 1020 tcgcaacaggagggcggatc gtccccaggt tctcagagct cacagccgag aagctgggct 1080 ttgctggtcttgtacaggag atctcatttg ggacaactaa ggataaaatg ctggtcatcg 1140 agcagtgtaagaactccaga gctgtaacca tttttattag aggaggaaat aagatgatca 1200 ttgaggaggcgaaacgatcc cttcacgatg ctttgtgtgt catccggaac ctcatccgcg 1260 ataatcgtgtggtgtatgga ggaggggctg ctgagatatc ctgtgccctg gcagttagcc 1320 aagaggcggataagtgcccc accttagaac agtatgccat gagagcgttt gccgacgcac 1380 tggaggtcatccccatggcc ctctctgaaa acagtggcat gaatcccatc cagactatga 1440 ccgaagtccgagccagacag gtgaaggaga tgaaccctgc tcttggcatc gactgtttgc 1500 acaaggggacaaatgatatg aagcaacagc atgtcataga aaccttgatt ggcaaaaagc 1560 aacagatatctcttgcaaca caaatggtta gaatgatttt gaagattgat gacattcgta 1620 agcctggagaatctgaagaa tgaagacatt gagaaaacta tgtagcaaga tccacttctg 1680 tgattaagtaaatggatgtc tcgtgatgca tctacagtta tttattgtta catccttttc 1740 cagacactgtagatgctata ataaaaatag ctgtttggta accatagttt cacttgttca 1800 aagctgtgtaatcgtggggg taccatctca actgcttttg tattcattgt attaaaagaa 1860 tctgtttaaacaacctttat cttctcttcg ggtttaagaa acgtttattg taacagtaat 1920 taaatgctgccttaattg 1938 73 1231 DNA Homo sapiens 73 aggtctcagc cggtcgtcgcgacgttcgcc cgctcgctct gaggctcctg aagccgaaac 60 tagctagact ttcctccttcccgcctgcct gtagcggcgt tgttgccact ccgccaccat 120 gttcgaggcg cgcctggtccagggctccat cctcaagaag gtgttggagg cactcaagga 180 cctcatcaac gaggcctgctgggatattag ctccagcggt gtaaacctgc agagcatgga 240 ctcgtcccac gtctctttggtgcagctcac cctgcggtct gagggcttcg acacctaccg 300 ctgcgaccgc aacctggccatgggcgtgaa cctcaccagt atgtccaaaa tactaaaatg 360 cgccggcaat gaagatatcattacactaag ggccgaagat aacgcggata ccttggcgct 420 agtatttgaa gcaccaaaccaggagaaagt ttcagactat gaaatgaagt tgatggattt 480 agatgttgaa caacttggaattccagaaca ggagtacagc tgtgtagtaa agatgccttc 540 tggtgaattt gcacgtatatgccgagatct cagccatatt ggagatgctg ttgtaatttc 600 ctgtgcaaaa gacggagtgaaattttctgc aagtggagaa cttggaaatg gaaacattaa 660 attgtcacag acaagtaatgtcgataaaga ggaggaagct gttaccatag agatgaatga 720 accagttcaa ctaacttttgcactgaggta cctgaacttc tttacaaaag ccactccact 780 ctcttcaacg gtgacactcagtatgtctgc agatgtaccc cttgttgtag agtataaaat 840 tgcggatatg ggacacttaaaatactactt ggctcccaag atcgaggatg aagaaggatc 900 ttaggcattc ttaaaattcaagaaaataaa actaagctct ttgagaactg cttctaagat 960 gccagcatat actgaagtcttttctgtcac caaatttgta cctctaagta catatgtaga 1020 tattgttttc tgtaaataacctattttttt tctctattct ctccaatttg tttaaagaat 1080 aaagtccaaa gtctgatctggtctagttaa cctagaagta tttttgtctc ttagaaatac 1140 ttgtgatttt tataatacaaaagggtcttg actctaaatg cagttttaag aagtgttttt 1200 gaatttaaat aaagttacttgaatttcaaa c 1231 74 2025 DNA Homo sapiens 74 cggcacgagg caccccgagaggagaagcgc agcgcagtgg cgagaggagc cccttgtggc 60 agcagcacta cctgcccagaaaaatgctgg aggctgggcg tggccccagg cctggggacc 120 tgtttttcct gtttcccgcagagttccctg cagcccggtc caggtccagg cgtgtgcatt 180 catgagtgag gaacccgtgcaggcgctgag catcctgacc tggagagcag gggctggtca 240 gggcgatggc agcagacctgggcccctgga atgacaccat caatggcacc tgggatgggg 300 atgagctggg ctacaggtgccgcttcaacg aggacttcaa gtacgtgctg ctgcctgtgt 360 cctacggcgt ggtgtgcgtgcttgggctgt gtctgaacgc cgtggcgctc tacatcttct 420 tgtgccgcct caagacctggaatgcgtcca ccacatatat gttccacctg gctgtgtctg 480 atgcactgta tgcggcctccctgccgctgc tggtctatta ctacgcccgc ggcgaccact 540 ggcccttcag cacggtgctctgcaagctgg tgcgcttcct cttctacacc aacctttact 600 gcagcatcct cttcctcacctgcatcagcg tgcaccggtg tctgggcgtc ttacgacctc 660 tgcgctccct gcgctggggccgggcccgct acgctcgccg ggtggccggg gccgtgtggg 720 tgttggtgct ggcctgccaggcccccgtgc tctactttgt caccaccagc gcgcgcgggg 780 gccgcgtaac ctgccacgacacctcggcac ccgagctctt cagccgcttc gtggcctaca 840 gctcagtcat gctgggcctgctcttcgcgg tgccctttgc cgtcatcctt gtctgttacg 900 tgctcatggc tcggcgactgctaaagccag cctacgggac ctcgggcggc ctccctaggg 960 ccaagcgcaa gtccgtgcgcaccatcgccg tggtgctggc tgtcttcgcc ctctgcttcc 1020 tgccattcca cgtcacccgcaccctctact actccttccg ctcgctggac ctcagctgcc 1080 acaccctcaa cgccatcaacatggcctaca aggttacccg gccgctggcc agtgctaaca 1140 gttgccttga ccccgtgctctacttcctgg ctgggcagag gctcgtacgc tttgcccgag 1200 atgccaagcc acccactggccccagccctg ccaccccggc tcgccgcagg ctgggcctgc 1260 gcagatccga cagaactgacatgcagagga taggagatgt gttgggcagc agtgaggact 1320 tcaggcggac agagtccacgccggctggta gcgagaacac taaggacatt cggctgtagg 1380 agcagaacac ttcagcctgtgcaggtttat attgggaagc tgtagaggac caggacttgt 1440 gcagacgcca cagtctccccagatatggac catcagtgac tcatgctgga tgaccccatg 1500 ctccgtcatt tgacaggggctcaggatatt cactctgtgg tccagagtca actgttccca 1560 taacccctag tcatcgtttgtgtgtataag ttgggggaat taagtttcaa gaaaggcaag 1620 agctcaaggt caatgacacccctggcctga ctcccatgca agtagctggc tgtactgcca 1680 aggtacctag gttggagtccagcctaatca agtcaaatgg agaaacaggc ccagagagga 1740 aggtggctta ccaagatcacataccagagt ctggagctga gctacctggg gtgggggcca 1800 agtcacaggt tggccagaaaaccctggtaa gtaatgaggg ctgagtttgc acagtggtct 1860 ggaatggact gggtgccacggtggacttag ctctgaggag tacccccagc ccaagagatg 1920 aacatctggg gactaatatcatagacccat ctggaggctc ccatgggcta ggagcagtgt 1980 gaggctgtaa cttatactaaaggttgtgtt gcctgctaaa aaaaa 2025 75 4910 DNA Homo sapiens 75 tagacgcaccctctgaagat ggtgactccc tcctgagaag ctggacccct tggtaaaaga 60 caaggccttctccaagaaga atatgaaagt gttactcaga cttatttgtt tcatagctct 120 actgatttcttctctggagg ctgataaatg caaggaacgt gaagaaaaaa taattttagt 180 gtcatctgcaaatgaaattg atgttcgtcc ctgtcctctt aacccaaatg aacacaaagg 240 cactataacttggtataaag atgacagcaa gacacctgta tctacagaac aagcctccag 300 gattcatcaacacaaagaga aactttggtt tgttcctgct aaggtggagg attcaggaca 360 ttactattgcgtggtaagaa attcatctta ctgcctcaga attaaaataa gtgcaaaatt 420 tgtggagaatgagcctaact tatgttataa tgcacaagcc atatttaagc agaaactacc 480 cgttgcaggagacggaggac ttgtgtgccc ttatatggag ttttttaaaa atgaaaataa 540 tgagttacctaaattacagt ggtataagga ttgcaaacct ctacttcttg acaatataca 600 ctttagtggagtcaaagata ggctcatcgt gatgaatgtg gctgaaaagc atagagggaa 660 ctatacttgtcatgcatcct acacatactt gggcaagcaa tatcctatta cccgggtaat 720 agaatttattactctagagg aaaacaaacc cacaaggcct gtgattgtga gcccagctaa 780 tgagacaatggaagtagact tgggatccca gatacaattg atctgtaatg tcaccggcca 840 gttgagtgacattgcttact ggaagtggaa tgggtcagta attgatgaag atgacccagt 900 gctaggggaagactattaca gtgtggaaaa tcctgcaaac aaaagaagga gtaccctcat 960 cacagtgcttaatatatcgg aaattgaaag tagattttat aaacatccat ttacctgttt 1020 tgccaagaatacacatggta tagatgcagc atatatccag ttaatatatc cagtcactaa 1080 tttccagaagcacatgattg gtatatgtgt cacgttgaca gtcataattg tgtgttctgt 1140 tttcatctataaaatcttca agattgacat tgtgctttgg tacagggatt cctgctatga 1200 ttttctcccaataaaagctt cagatggaaa gacctatgac gcatatatac tgtatccaaa 1260 gactgttggggaagggtcta cctctgactg tgatattttt gtgtttaaag tcttgcctga 1320 ggtcttggaaaaacagtgtg gatataagct gttcatttat ggaagggatg actacgttgg 1380 ggaagacattgttgaggtca ttaatgaaaa cgtaaagaaa agcagaagac tgattatcat 1440 tttagtcagagaaacatcag gcttcagctg gctgggtggt tcatctgaag agcaaatagc 1500 catgtataatgctcttgttc aggatggaat taaagttgtc ctgcttgagc tggagaaaat 1560 ccaagactatgagaaaatgc cagaatcgat taaattcatt aagcagaaac atggggctat 1620 ccgctggtcaggggacttta cacagggacc acagtctgca aagacaaggt tctggaagaa 1680 tgtcaggtaccacatgccag tccagcgacg gtcaccttca tctaaacacc agttactgtc 1740 accagccactaaggagaaac tgcaaagaga ggctcacgtg cctctcgggt agcatggaga 1800 agttgccaagagttctttag gtgcctcctg tcttatggcg ttgcaggcca ggttatgcct 1860 catgctgacttgcagagttc atggaatgta actatatcat cctttatccc tgaggtcacc 1920 tggaatcagattattaaggg aataagccat gacgtcaata gcagcccagg gcacttcaga 1980 gtagagggcttgggaagatc ttttaaaaag gcagtaggcc cggtgtggtg gctcacgcct 2040 ataatcccagcactttggga ggctgaagtg ggtggatcac cagaggtcag gagttcgaga 2100 ccagcccagccaacatggca aaaccccatc tctactaaaa atacaaaaat gagctaggca 2160 tggtggcacacgcctgtaat cccagctaca cctgaggctg aggcaggaga attgcttgaa 2220 ccggggagacggaggttgca gtgagccgag tttgggccac tgcactctag cctggcaaca 2280 gagcaagactccgtctcaaa aaaagggcaa taaatgccct ctctgaatgt ttgaactgcc 2340 aagaaaaggcatggagacag cgaactagaa gaaagggcaa gaaggaaata gccaccgtct 2400 acagatggcttagttaagtc atccacagcc caagggcggg gctatgcctt gtctggggac 2460 cctgtagagtcactgaccct ggagcggctc tcctgagagg tgctgcaggc aaagtgagac 2520 tgacacctcactgaggaagg gagacatatt cttggagaac tttccatctg cttgtatttt 2580 ccatacacatccccagccag aagttagtgt ccgaagaccg aattttattt tacagagctt 2640 gaaaactcacttcaatgaac aaagggattc tccaggattc caaagttttg aagtcatctt 2700 agctttccacaggagggaga gaacttaaaa aagcaacagt agcagggaat tgatccactt 2760 cttaatgctttcctccctgg catgaccatc ctgtcctttg ttattatcct gcattttacg 2820 tctttggaggaacagctccc tagtggcttc ctccgtctgc aatgtccctt gcacagccca 2880 cacatgaaccatccttccca tgatgccgct cttctgtcat cccgctcctg ctgaaacacc 2940 tcccaggggctccacctgtt caggagctga agcccatgct ttcccaccag catgtcactc 3000 ccagaccacctccctgccct gtcctccagc ttcccctcgc tgtcctgctg tgtgaattcc 3060 caggttggcctggtggccat gtcgcctgcc cccagcactc ctctgtctct gctcttgcct 3120 cgacccttcctcctcctttg cctaggaggc cttctcgcat tttctctagc tgatcagaat 3180 tttaccaaaattcagaacat cctccaattc cacagtctct gggagacttt ccctaagagg 3240 cgacttcctctccagccttc tctctctggt caggcccact gcagagatgg tggtgagcac 3300 atctgggaggctggtctccc tccagctgga attgctgctc tctgagggag aggctgtggt 3360 ggctgtctctgtccctcact gccttccagg agcaatttgc acatgtaaca tagatttatg 3420 taatgctttatgtttaaaaa cattccccaa ttatcttatt taatttttgc aattattcta 3480 attttatatatagagaaagt gacctatttt ttaaaaaaat cacactctaa gttctattga 3540 acctaggacttgagcctcca tttctggctt ctagtctggt gttctgagta cttgatttca 3600 ggtcaataacggtcccccct cactccacac tggcacgttt gtgagaagaa atgacatttt 3660 gctaggaagtgaccgagtct aggaatgctt ttattcaaga caccaaattc caaacttcta 3720 aatgttggaattttcaaaaa ttgtgtttag attttatgaa aaactcttct actttcatct 3780 attctttccctagaggcaaa catttcttaa aatgtttcat tttcattaaa aatgaaagcc 3840 aaatttatatgccaccgatt gcaggacaca agcacagttt taagagttgt atgaacatgg 3900 agaggacttttggtttttat atttctcgta tttaatatgg gtgaacacca acttttattt 3960 ggaataataattttcctcct aaacaaaaac acattgagtt taagtctctg actcttgcct 4020 ttccacctgctttctcctgg gcccgctttg cctgcttgaa ggaacagtgc tgttctggag 4080 ctgctgttccaacagacagg gcctagcttt catttgacac acagactaca gccagaagcc 4140 catggagcagggatgtcacg tcttgaaaag cctattagat gttttacaaa tttaattttg 4200 cagattattttagtctgtca tccagaaaat gtgtcagcat gcatagtgct aagaaagcaa 4260 gccaatttggaaacttaggt tagtgacaaa attggccaga gagtgggggt gatgatgacc 4320 aagaattacaagtagaatgg cagctggaat ttaaggaggg acaagaatca atggataagc 4380 gtgggtggaggaagatccaa acagaaaagt gcaaagttat tccccatctt ccaagggttg 4440 aattctggaggaagaagaca cattcctagt tccccgtgaa cttcctttga cttattgtcc 4500 ccactaaaacaaaacaaaaa acttttaatg ccttccacat taattagatt ttcttgcagt 4560 ttttttatggcattttttta aagatgccct aagtgttgaa gaagagtttg caaatgcaac 4620 aaaaatatttaattaccggt tgttaaaact ggtttagcac aatttatatt ttccctctct 4680 tgcctttcttatttgcaata aaaggtattg agccattttt taaatgacat ttttgataaa 4740 ttatgtttgtactagttgat gaaggagttt tttttaacct gtttatataa ttttgcagca 4800 gaagccaaattttttgtata ttaaagcacc aaattcatgt acagcatgca tcacggatca 4860 atagactgtacttattttcc aataaaattt tcaaactttg tactgttaaa 4910 76 2592 DNA Homosapiens 76 gccccacgca cggacaggag tgaacccgag ctgtgccgac caacccccaggatggcggaa 60 gctcaccagg ccgtggcctt ccagttcacg gtgaccccag acggggtcgacttccggctc 120 agtcgggagg ccctgaaaca cgtctacctg tctgggatca actcctggaagaaacgcctg 180 atccgcatca agaatggcat cctcaggggc gtgtaccctg gcagccccaccagctggctg 240 gtcgtcatca tggcaacagt gggttcctcc ttctgcaacg tggacatctccttggggctg 300 gtcagttgca tccagagatg cctccctcag gggtgtggcc cctaccagaccccgcagacc 360 cgggcacttc tcagcatggc catcttctcc acgggcgtct gggtgacgggcatcttcttc 420 ttccgccaaa ccctgaagct gcttctctgc taccatgggt ggatgtttgagatgcatggc 480 aagaccagca acttgaccag gatctgggct atgtgtatcc gccttctatccagccggcac 540 cctatgctct acagcttcca gacatctctg cccaagcttc ctgtgcccagggtgtcagcc 600 acaattcagc ggtacctaga gtctgtgcgc cccttgttgg atgatgaggaatattaccgc 660 atggagttgc tggccaaaga attccaggac aagactgccc ccaggctgcagaaatacctg 720 gtgctcaagt catggtgggc aagtaactat gtgagtgact ggtgggaagagtacatctac 780 cttcgaggca ggagccctct catggtgaac agcaactatt atgtcatggaccttgtgctc 840 atcaagaata cagacgtgca ggcagcccgc ctgggaaaca tcatccacgccatgatcatg 900 tatcgccgta aactggaccg tgaagaaatc aagcctgtga tggcactgggcatagtgcct 960 atgtgctcct accagatgga gaggatgttc aacaccactc ggatcccgggcaaggacaca 1020 gatgtgctac agcacctctc agacagccgg cacgtggctg tctaccacaagggacgcttc 1080 ttcaagctgt ggctctatga gggcgcccgt ctgctcaagc ctcaggatctggagatgcag 1140 ttccagagga tcctggacga cccctcccca cctcagcctg gggaggagaagctggcagcc 1200 ctcactgcag gaggaagggt ggagtgggcg caggcacgcc aggccttctttagctctgga 1260 aagaataagg ctgccttgga ggccatcgag cgtgccgctt tcttcgtggccctggatgag 1320 gaatcctact cctatgaccc cgaagatgag gccagcctca gcctctatggcaaggccctg 1380 ctacatggca actgctacaa caggtggttt gacaaatcct tcactctcatttccttcaag 1440 aatggccagt tgggtctcaa tgcagagcat gcgtgggcag atgctcccatcattgggcac 1500 ctctgggagt ttgtcctggg cacagacagc ttccacctgg gctacacggagaccgggcac 1560 tgcctgggca aaccgaaccc tgcgctcgca cctcctacac ggctgcagtgggacattcca 1620 aaacagtgcc aggcggtcat cgagagttcc taccaggtgg ccaaggcgttggcagacgac 1680 gtggagttgt actgcttcca gttcctgccc tttggcaaag gcctcatcaagaagtgccgg 1740 accagccctg atgcctttgt gcagatcgcg ctgcagctgg ctcacttccgggacaggggt 1800 aagttctgcc tgacctatga ggcctcaatg accagaatgt tccgggagggacggactgag 1860 actgtgcgtt cctgtaccag cgagtccaca gcctttgtgc aggccatgatggaggggtcc 1920 cacacaaaag cagacctgcg agatctcttc cagaaggctg ctaagaagcaccagaatatg 1980 taccgcctgg ccatgaccgg ggcagggatc gacaggcacc tcttctgcctttacttggtc 2040 tccaagtacc taggagtcag ctctcctttc cttgctgagg tgctctcggaaccctggcgt 2100 ctctccacca gccagatccc ccaatcccag atccgcatgt tcgacccagagcagcacccc 2160 aatcacctgg gcgctggagg tggctttggc cctgtagcag atgatggctatggagtttcc 2220 tacatgattg caggcgagaa cacgatcttc ttccacatct ccagcaagttctcaagctca 2280 gagacgaacg cccagcgctt tggaaaccac atccgcaaag ccctgctggacattgctgat 2340 cttttccaag ttcccaaggc ctacagctga agcccttagg tacctgtgttttgtttggga 2400 actcggaggc cctccccctc ccccagctca gaccacagag gtggcaagagaagggctgaa 2460 gctggaagac tgttcatgag ggacttgtgt gacctgcttt gaaatgtgtgactctgctga 2520 gtgacgtagg ctctgagata gctgtccacg cccacgtgtt tgcttggaataaatacttgc 2580 ctcagaacct tc 2592 77 1429 DNA Homo sapiens 77cagcatggct acgaaatgtg ggaattgtgg acccggctac tccacccctc tggaggccat 60gaaaggaccc agggaagaga tcgtctacct gccctgcatt taccgaaaca caggcactga 120ggccccagat tatctggcca ctgtggatgt tgaccccaag tctccccagt attgccaggt 180catccaccgg ctgcccatgc ccaacctgaa ggacgagctg catcactcag gatggaacac 240ctacagcagc tgcttcggtg atagcaccaa gtcgcgcaac aagctggtct tgcccagtct 300catctcctct cgcatctatg tggtggacgt gggctctgag cccgggcccc aaaagctgca 360caaggtcatt gagcccaagg acatccatgc caagtgcgaa ctggcctgtc tccacaccag 420ccactgcctg gccagcgggg aagtgatgat cagctccctg ggggacgtca agggcaatgg 480caaagggggt tttgtgctgc tggatgggga gacgttcgag gtgaagggga catgggagag 540acctgggggt gctgcaccgt tgggctatga cttctggtac cagcctcgac acaatgtcat 600gatcagcact gagtgggcag ctcccaatgt cttacgagat ggctttaacc ccgctgatgt 660ggaggctgga ctgtacggga gccacttata tgtatgggac tggcagcgcc atgagattgt 720gcagaccctg tctctaaaag atgggctgat acccttggag atccgcttcc tgcacaaccc 780aagtgccacc cagggttttg taggctgtgc ctcagctcca aacatccagc gcttctacaa 840aacgagggaa ggtacatggt cagtggagaa ggtgatccag gtgcccccca agaaagtgaa 900gggctggctg ctgccagggg tgccaggcct gatcaccgac atcctgctct ccctggacga 960ccgcttcctc tacttcagca actggctgca tggggacctg aggcagtatg acatctctga 1020cccacagaga ccccgcctca caggacagct cttcctcgga ggcagcattg ttaagggagg 1080ccctgtgcaa gtgctggagg acgaggaact aaagtcccag ccagagcccc tagtggtcaa 1140gggaaaacgg gtggctggag gccctcagat gatccagctc agcctggatg gcaagcgcct 1200ctacatcacc acgtcgctgt acagtgcctg ggaaaagcag ttttaccctg atctcatcag 1260ggaaggctct gtaatgctgc aggttgatgt agacacagta aaaggagggc tgaagttgaa 1320ccccaactgc ctggtggact tcgggaagga gccccttggc ccagccctgg ctcacgagct 1380tcgctaccct gggggcgatt gtagctctga catctggatt tgaaggctc 1429 78 5683 DNAHomo sapiens 78 ccgcccggtg ttgcgctcct tcccagaatc cgctccggcc tttccttcctgccgcgattc 60 ccaactttgc tcaaagtcgc cggactctaa gctgtcggag ggaccgctggacagacctgg 120 gaactgacag agggcctgga gggaaatagg ccaaagaccc acaggatggagctgacctca 180 accgaaagag ggaggggaca gcctctgccc tgggaacttc gactgcccctactgctaagc 240 gtgctggctg ccacactggc acaggcccct gccccggatg tccctggctgttccagggga 300 agctgctacc ccgccacggc cgacctgctg gtgggccgag ctgacagactgactgcctca 360 tccacttgtg gcctgaatgg ccgccagccc tactgcatcg tcagtcacctgcaggacgaa 420 aagaagtgct tcctttgtga ctcccggcgc cccttctctg ctagagacaacccacacacc 480 catcgcatcc agaatgtagt caccagcttt gcaccacagc ggcgggcagcttggtggcag 540 tcacagaatg gtatccctgc ggtcaccatc cagctggacc tggaggctgagtttcatttc 600 acacacctca ttatgacctt caagacattt cgccctgctg ccatgctggtcgaacgctca 660 gcagactttg gccgcacctg gcatgtgtac cgatatttct cctatcactgtggggctgac 720 ttcccaggag tcccactagc acccccacgg cactgggatg atgtagtctgtgagtcccgc 780 tactcagaga ttgagccatc cactgaaggc gaggtcatct atcgtgtgctggaccctgcc 840 atccctatcc cagaccccta cagctcacgg attcagaacc tgttgaagatcaccaaccta 900 cgggtgaacc tgactcgtct acacacgttg ggagacaacc tactcgacccacggagggag 960 atccgagaga agtactacta tgccctctat gagctggttg tacgtggcaactgcttctgc 1020 tacggacacg cctcagagtg tgcacccgcc ccaggggcac cagcccatgctgagggcatg 1080 gtgcacggag cttgcatctg caaacacaac acacgtggcc tcaactgcgagcagtgtcag 1140 gatttctatc gtgacctgcc ctggcgtccg gctgaggacg gccatagtcatgcctgtagg 1200 aagtgtgatc ggcatgggca cacccacagc tgccacttcg acatggccgtatacctcgga 1260 tctggcaatg tgagtggagg tgtgtgtgat ggatgtcagc ataacacagcgtggcgccac 1320 tgtgagctct gtcggccctt cttctaccgt gacccaacca aggacctgcgggatccggct 1380 gtgtgccgct cctgtgattg tgaccccatg ggttctcaag acggtggtcgctgtgattcc 1440 catgatgacc ctgcactggg actggtctcc ggccagtgtc gctgcaaagaacacgtggtg 1500 ggcactcgct gccagcaatg ccgtgatggc ttctttgggc tcagcatcagtgacccgtct 1560 gggtgccggc gatgtcaatg taatgcacgg ggcacagtgc ctgggagcactccttgtgac 1620 cccaacagtg gatcctgtta ctgcaaacgt ctagtgactg gacgtggatgtgaccgctgc 1680 ctgcctggcc actggggcct gagcctcgac ctgctcggct gccgcccctgtgactgcgac 1740 gtgggtggtg ctttggatcc ccagtgtgat gagggcacag gtcaatgccactgccgccag 1800 cacatggttg ggcgacgctg tgagcaggtg caacctggct acttccggcccttcctggac 1860 cacctaattt gggaggctga gaacacccga gggcaggtgc tcgatgtggtggagcgcctg 1920 gtgacccccg gggaaactcc atcctggact ggctcaggct tcgtgcgactacaggaaggt 1980 cagaccctgg agttcctggt ggcctctgtg ccgaacgcga tggactatgacctgctgctg 2040 cgcttagagc cccaggtccc tgagcaatgg gcagagttgg aactgattgtgcagcgtcca 2100 gggcctgtgc ctgcccacag cctgtgtggg catttggtgc ccagggatgatcgcatccaa 2160 gggactctgc aaccacatgc caggtacttg atatttccta atcctgtctgccttgagcct 2220 ggtatctcct acaagctgca tctgaagctg gtacggacag ggggaagtgcccagcctgag 2280 actccctact ctggacctgg cctgctcatt gactcgctgg tgctgctgccccgtgtcctg 2340 gtgctagaga tgtttagtgg gggtgatgct gctgccctgg agcgccaggccacctttgaa 2400 cgctaccaat gccatgagga gggtctggtg cccagcaaga cttctccctctgaggcctgc 2460 gcacccctcc tcatcagcct gtccaccctc atctacaatg gtgccctgccatgtcagtgc 2520 aaccctcaag gttcactgag ttctgagtgc aaccctcatg gtggtcagtgcctgtgcaag 2580 cctggagtgg ttgggcgccg ctgtgacacg tgtgcccctg gctactatggctttggcccc 2640 acaggctgtc aagcctgcca gtgcagccca cgaggggcac tcagcagtctctgtgaaagg 2700 accagtgggc aatgtctctg tcgaactggt gcctttgggc ttcgctgtgacgcctgccag 2760 cgtggccagt ggggattccc tagctgccgg ccatgtgtct gcaatgggcatgcagatgag 2820 tgcaacaccc acacaggcgc ttgcctgggc tgccgtgatc tcacagggggtgagcactgt 2880 gaaaggtgca ttgctggttt ccacggggac ccacggctgc catatggggcgcagtgccgg 2940 ccctgtccct gtcctgaagg ccctgggagc caacggcact ttgctacttcttgccaccag 3000 gatgaatatt cccagcagat tgtgtgccac tgccgggcag gctatacggggctgcgatgt 3060 gaagcttgtg cccctgggca gtttggggac ccatcaaggc caggtggccggtgccaactg 3120 tgtgagtgca gtgggaacat tgacccaatg gatcctgatg cctgtgacccacaccccggg 3180 caatgcctgc gctgtttaca ccacacagag ggtccacact gtgcccactcgaagcctggc 3240 ttccatggcc aggctgcccg gcagagctgt caccgctgca catgcaacctgctgggcaca 3300 aatccgcagc agtgcccatc tcctgaccag tgccactgtg atccaagcagtgggcagtgc 3360 ccatgcctcc ccaatgtcca ggccctagct gtagaccgct gtgcccccaacttctggaac 3420 ctcaccagtg gccatggttg ccagccttgt gcctgcctcc caagcccggaagaaggcccc 3480 acctgcaacg agttcacagg gcagtgccac tgcctgtgcg gctttggagggcggacttgt 3540 tctgagtgcc aagagctcca ctggggagac cctgggttgc agtgccatgcctgtgattgt 3600 gactctcgtg gaatagatac acctcagtgt caccgcttca caggtcactgcacgtgccgc 3660 ccaggggtgt ctggtgtgcg ctgtgaccag tgtgcccgtg gcttctcaggaatctttcct 3720 gcctgccatc cctgccatgc atgcttcggg gattgggacc gagtggtgcaggacttggca 3780 gcccgtacac agcgcctaga gcagcgggcg caggagttgc aacagacgggtgtgctgggt 3840 gcctttgaga gcagcttctg gcacatgcag gagaagctgg gcattgtgcagggcatcgta 3900 ggtgcccgca acacctcagc cgcctccact gcacagcttg tggaggccacagaggagctg 3960 cggcgtgaaa ttggggaggc cactgagcac ctgactcagc tcgaggcagacctgacagat 4020 gtgcaagatg agaacttcaa tgccaaccat gcactaagtg gtctggagcgagataggctt 4080 gcacttaatc tcacactgcg gcagctcgac cagcatcttg acttgctcaaacattcaaac 4140 ttcctgggtg cctatgacag catccggcat gcccatagcc agtctgcagaggcagaacgt 4200 cgtgccaata cctcagccct ggcagtacct agccctgtga gcaactcggcaagtgctcgg 4260 catcggacag aggcactgat ggatgctcag aaggaggact tcaacagcaaacacatggcc 4320 aaccagcggg cacttggcaa gctctctgcc catacccaca ccctgagcctgacagacata 4380 aatgagctgg tgtgtggggc ccagggattg catcatgatc gtacaagcccttgtgggggt 4440 gccggctgtc gagatgagga tgggcagccg cgctgtgggg gcctcagctgcaatggggca 4500 gcggctacag cagacctagc actgggccgg gcccggcaca cacaggcagagctgcagcgg 4560 gcactggcag aaggtggtag catcctcagc agagtggctg agactcgtcggcaggcaagc 4620 gaggcacagc agcgggccca ggcagccctg gacaaggcta atgcttccaggggacaggtg 4680 gaacaggcca accaggaact tcaagaactt atccagagtg tgaaggacttcctcaaccag 4740 gagggggctg atcctgatag cattgaaatg gtggccacac gggtgctagagctctccatc 4800 ccagcttcag ctgagcagat ccagcacctg gcgggcgcga ttgcagagcgagtccggagc 4860 ctggcagatg tggatgcgat cctggcacgt actgtaggag atgtgcgtcgtgccgagcag 4920 ctactgcagg atgcacggcg ggcaaggagc tgggctgagg atgagaaacagaaggcagag 4980 acagtacagg cagcactgga ggaggcccag cgggcacagg gtattgcccagggtgccatc 5040 cggggggcag tggctgacac acgggacaca gagcagaccc tgtaccaggtacaggagagg 5100 atggcaggtg cagagcgggc actgagctct gcaggtgaaa gggctcggcagttggatgct 5160 ctcctggagg ctctgaaatt gaaacgggca ggaaatagtc tggcagcctctacagcagaa 5220 gaaacggcag gcagtgccca gggtcgtgcc caggaggctg agcagctgctacgcggtcct 5280 ctgggtgatc agtaccagac ggtgaaggcc ctagctgagc gcaaggcccaaggtgtgctg 5340 gctgcacagg caagggcaga acaactgccg gatgaggctc gggacctgttgcaagccgct 5400 caggacaagc tgcagcggct acaggaattg gaaggcacct atgaggaaaatgagcgggca 5460 ctggagagta aggcagccca gttggacggg ttggaggcca ggatgcgcagcgtgcttcaa 5520 gccatcaact tgcaggtgca gatctacaac acctgccagt gacccctgcccaaggcctac 5580 cccagttcct agcactgccc cacatgcatg tctgcctatg cactgaagagctcttggccc 5640 ggcagggccc ccaataaacc agtgtgaacc cccaaaaaaa aaa 5683 795177 DNA Homo sapiens 79 ggactgcgaa aggagcaggg ttgcggagct agggctccagcctgcggccg cgcattcttg 60 cgtctggcca gccgcgagct ctaagggtcg gccccgcccggtccgccccc gcggctccct 120 gccaggctct cgcgggcgcg ctcggggtgg ggcctcgcggctggcggaga tgcggccggg 180 gctgcgcggt ggtgatgcga gcctgctggg cggcgcgccggggcagccgg agccgcgcgc 240 cgcggcgctg taatcggaca ccaagagcgc tcgcccccggcctccggcca ctttccattc 300 actccgaggt gcttgattga gcgacgcgga gaagagctccgggtgccgcg gcactgcagc 360 gctgagattc ctttacaaag aaactcagag gaccgggaagaaagaatttc acctttgcga 420 cgtgctagaa aataaggtcg tctgggaaaa ggactggagacacaagcgca tccaaccccg 480 gtagcaaact gatgactttt ccgtgctgat ttctttcaacctcggtattt tcccttggat 540 attaacttgc atatctgaag aaatggcatt ccggacaatttgcgtgttgg ttggagtatt 600 tatttgttct atctgtgtga aaggatcttc ccagccccaagcaagagttt atttaacatt 660 tgatgaactt cgagaaacca agacctctga atacttcagcctttcccacc atcctttaga 720 ctacaggatt ttattaatgg atgaagatca ggaccggatatatgtgggaa gcaaagatca 780 cattctttcc ctgaatatta acaatataag tcaagaagctttgagtgttt tctggccagc 840 atctacaatc aaagttgaag aatgcaaaat ggctggcaaagatcccacac acggctgtgg 900 gaactttgtc cgtgtaattc agactttcaa tcgcacacatttgtatgtct gtgggagtgg 960 cgctttcagt cctgtctgta cttacttgaa cagagggaggagatcagagg accaagtttt 1020 catgattgac tccaagtgtg aatctggaaa aggacgctgctctttcaacc ccaacgtgaa 1080 cacggtgtct gttatgatca atgaggagct tttctctggaatgtatatag atttcatggg 1140 gacagatgct gctatttttc gaagtttaac caagaggaatgcggtcagaa ctgatcaaca 1200 taattccaaa tggctaagtg aacctatgtt tgtagatgcacatgtcatcc cagatggtac 1260 tgatccaaat gatgctaagg tgtacttctt cttcaaagaaaaactgactg acaataacag 1320 gagcacgaaa cagattcatt ccatgattgc tcgaatatgtcctaatgaca ctggtggact 1380 gcgtagcctt gtcaacaagt ggaccacttt cttaaaggcgaggctggtgt gctcggtaac 1440 agatgaagac ggcccagaaa cacactttga tgaattagaggatgtgtttc tgctggaaac 1500 tgataacccg aggacaacac tagtgtatgg catttttacaacatcaagct cagttttcaa 1560 aggatcagcc gtgtgtgtgt atcatttatc tgatatacagactgtgttta atgggccttt 1620 tgcccacaaa gaagggccca atcatcagct gatttcctatcagggcagaa ttccatatcc 1680 tcgccctgga acttgtccag gaggagcatt tacacccaatatgcgaacca ccaaggagtt 1740 cccagatgat gttgtcactt ttattcggaa ccatcctctcatgtacaatt ccatctaccc 1800 aatccacaaa aggcctttga ttgttcgtat tggcactgactacaagtaca caaagatagc 1860 tgtggatcga gtgaacgctg ctgatgggag ataccatgtcctgtttctcg gaacagatcg 1920 gggtactgtg caaaaagtgg ttgttcttcc tactaacaactctgtcagtg gcgagctcat 1980 tctggaggag ctggaagtct ttaagaatca tgctcctataacaacaatga aaatttcatc 2040 taaaaagcaa cagttgtatg tgagttccaa tgaaggggtttcccaagtat ctctgcaccg 2100 ctgccacatc tatggtacag cctgtgctga ctgctgcctggcgcgggacc cttattgcgc 2160 ctgggatggc cattcctgtt ccagattcta cccaactgggaaacggagga gccgaagaca 2220 agatgtgaga catggaaacc cactgactca atgcagaggatttaatctaa aagcatacag 2280 aaatgcagct gaaattgtgc agtatggagt aaaaaataacaccacttttc tggagtgtgc 2340 ccccaagtct ccgcaggcat ctatcaagtg gctgttacagaaagacaaag acaggaggaa 2400 agaggttaag ctgaatgaac gaataatagc cacttcacagggactcctga tccgctctgt 2460 tcagggttct gaccaaggac tttatcactg cattgctacagaaaatagtt tcaagcagac 2520 catagccaag atcaacttca aagttttaga ttcagaaatggtggctgttg tgacggacaa 2580 atggtccccg tggacctggg ccagctctgt gagggctttacccttccacc cgaaggacat 2640 catgggggca ttcagccact cagaaatgca gatgattaaccaatactgca aagacactcg 2700 gcagcaacat cagcagggag atgaatcaca gaaaatgagaggggactatg gcaagttaaa 2760 ggccctcatc aatagtcgga aaagtagaaa caggaggaatcagttgccag agtcataata 2820 ttttcttatg tgggtcttat gcttccatta acaaatgctctgtcttcaat gatcaaattt 2880 tgagcaaaga aacttgtgct ttaccaaggg gaattactgaaaaaggtgat tactcctgaa 2940 gtgagtttta cacgaactga aatgagcatg cattttcttgtatgatagtg actagcacta 3000 gacatgtcat ggtcctcatg gtgcatataa atatatttaacttaacccag attttattta 3060 tatctttatt caccttttct tcaaaatcga tatggtggctgcaaaactag aattgttgca 3120 tccctcaatt gaatgagggc catatccctg tggtattcctttcctgcttt ggggctttag 3180 aattctaatt gtcagtgatt ttgtatatga aaacaagttccaaatccaca gcttttacgt 3240 agtaaaagtc ataaatgcat atgacagaat ggctatcaaaagaaatagaa aaggaagacg 3300 gcatttaaag ttgtataaaa acacgagtta ttcataaagagaaaatgatg agtttttatg 3360 gttccaatga aatatcttcc ccttttttta agattgtaaaaataatcagt tactggtatc 3420 tgtcactgac ctttgtttcc ttattcagga agataaaaatcagtaaccta ccccatgaag 3480 atatttggtg ggagttatat cagtgaagca gtttggtttatattcttatg ttatcacctt 3540 ccaaacaaaa gcacttactt tttttggaag ttatttaatttattttagac tcaaagaata 3600 taatcttgca ctactcagtt attactgttt gttctcttattccctagtct gtgtggcaaa 3660 ttaaacaata taagaaggaa aaatttgaag tattagacttctaaataagg ggtgaaatca 3720 tcagaaagaa aaatcaaagt agaaactact aattttttaagaggaattta taacaaatat 3780 ggctagtttt caacttcagt actcaaattc aatgattcttccttttatta aaaccagtct 3840 cagatatcat actgattttt aagtcaacac tatatattttatgatctttt cagtgtgatg 3900 gcaaggtgct tgttatgtct agaaagtaag aaaacaatatgaggagacat tctgtctttc 3960 aaaaggtaat ggtacatacg ttcactggtc tctaagtgtaaaagtagtaa attttgtgat 4020 gaataaaata attatctcct aattgtatgt tagaataattttattagaat aatttcatac 4080 tgaaattatt ttctccaaat aaaaattaga tggaaaaatgtgaaaaaaat tattcatgct 4140 ctcatatata ttttaaaaac actacttttg cttttttatttaccttttaa gacattttca 4200 tgcttccagg taaaaacaga tattgtacca tgtacctaatccaaatatca tataaacatt 4260 ttatttatag ttaataatct atgatgaagg taattaaagtagattatggc ctttttaagt 4320 attgcagtct aaaacttcaa aaactaaaat cattgtcaaaattaatatga ttattaatca 4380 gaatatcaga tatgattcac tatttaaact atgataaattatgataatat atgaggaggc 4440 ctcgctatag caaaaatagt taaaatgctg acataacaccaaacttcatt ttttaaaaaa 4500 tctgttgttc caaatgtgta taattttaaa gtaatttctaaagcagttta ttataatggt 4560 ttgcctgctt aaaaggtata attaaacttc ttttctcttctacattgaca cacagaaatg 4620 tgtcaatgta aagccaaaac catcttctgt gtttatggccaatctattct caaagttaaa 4680 agtaaaattg tttcagagtc acagttccct ttatttcacataagcccaaa ctgatagaca 4740 gtaacggtgt ttagttttat actatatttg tgctatttaattctttctat tttcacaatt 4800 attaaattgt gtacactttc attactttta aaaatgtagaaattcttcat gaacataact 4860 ctgctgaatg taaaagaaaa ttttttttca aaaatgctgttaatgtatac tactggtggt 4920 tgattggttt tattttatgt agcttgacaa ttcagtgacttaatatctat tccatttgta 4980 ttgtacataa aattttctag aaatacactt ttttccaaagtgtaagtttg tgaatagatt 5040 ttagcatgat gaaactgtca taatggtgaa tgttcaatctgtgtaagaaa acaaactaaa 5100 tgtagttgtc acactaaaat ttaattggat attgatgaaatcattggcct ggcaaaataa 5160 aacatgttga attcccc 5177 80 9164 DNA Homosapiens 80 ggctggaggg gcgctgggct cggacctgcc aaggccacgg gggagcaagggacagaggcg 60 ggggtcctag ctgacggctt ttactgccta ggatgacgct gcggcttctggtggccgcgc 120 tctgcgccgg gatcctggca gaggcgcccc gagtgcgagc ccagcacagggagagagtga 180 cctgcacgcg cctttacgcc gctgacattg tgttcttact ggatggctcctcatccattg 240 gccgcagcaa tttccgcgag gtccgcagct ttctcgaagg gctggtgctgcctttctctg 300 gagcagccag tgcacagggt gtgcgctttg ccacagtgca gtacagcgatgacccacgga 360 cagagttcgg cctggatgca cttggctctg ggggtgatgt gatccgcgccatccgtgagc 420 ttagctacaa ggggggcaac actcgcacag gggctgcaat tctccatgtggctgaccatg 480 tcttcctgcc ccagctggcc cgacctggtg tccccaaggt ctgcatcctgatcacagacg 540 ggaagtccca ggacctggtg gacacagctg cccaaaggct gaaggggcagggggtcaagc 600 tatttgctgt ggggatcaag aatgctgacc ctgaggagct gaagcgagttgcctcacagc 660 ccaccagtga cttcttcttc ttcgtcaatg acttcagcat cttgaggacactactgcccc 720 tcgtttcccg gagagtgtgc acgactgctg gtggcgtgcc tgtgacccgacctccggatg 780 actcgacctc tgctccacga gacctggtgc tgtctgagcc aagcagccaatccttgagag 840 tacagtggac agcggccagt ggccctgtga ctggctacaa ggtccagtacactcctctga 900 cggggctggg acagccactg ccgagtgagc ggcaggaggt gaacgtcccagctggtgaga 960 ccagtgtgcg gctgcggggt ctccggccac tgaccgagta ccaagtgactgtgattgccc 1020 tctacgccaa cagcatcggg gaggctgtga gcgggacagc tcggaccactgccctagaag 1080 ggccggaact gaccatccag aataccacag cccacagcct cctggtggcctggcggagtg 1140 tgccaggtgc cactggctac cgtgtgacat ggcgggtcct cagtggtgggcccacacagc 1200 agcaggagct gggccctggg cagggttcag tgttgctgcg tgacttggagcctggcacgg 1260 actatgaggt gaccgtgagc accctatttg gccgcagtgt ggggcccgccacttccctga 1320 tggctcgcac tgacgcttct gttgagcaga ccctgcgccc ggtcatcctgggccccacat 1380 ccatcctcct ttcctggaac ttggtgcctg aggcccgtgg ctaccggttggaatggcggc 1440 gtgagactgg cttggagcca ccgcagaagg tggtactgcc ctctgatgtgacccgctacc 1500 agttggatgg gctgcagccg ggcactgagt accgcctcac actctacactctgctggagg 1560 gccacgaggt ggccacccct gcaaccgtgg ttcccactgg accagagctgcctgtgagcc 1620 ctgtaacaga cctgcaagcc accgagctgc ccgggcagcg ggtgcgagtgtcctggagcc 1680 cagtccctgg tgccacccag taccgcatca ttgtgcgcag cacccagggggtggagcgga 1740 ccctggtgct tcctgggagt cagacagcat tcgacttgga tgacgttcaggctgggctta 1800 gctacactgt gcgggtgtct gctcgagtgg gtccccgtga gggcagtgccagtgtcctca 1860 ctgtccgccg ggagctggaa actccacttg ctgttccagg gctgcgggttgtggtgtcag 1920 atgcaacgcg agtgagggtg gcctggggac ccgtccctgg agccagtggatttcggatta 1980 gctggagcac aggcagtggt ccggagtcca gccagacact gcccccagactctactgcca 2040 cagacatcac agggctgcag cctggaacca cctaccaggt ggctgtgtcggtactgcgag 2100 gcagagagga gggccctgct gcagtcatcg tggctcgaac ggacccactgggcccagtga 2160 ggacggtcca tgtgactcag gccagcagct catctgtcac cattacctggaccagggttc 2220 ctggcgccac aggatacagg gtttcctggc actcagccca cggcccagagaaatcccagt 2280 tggtttctgg ggaggccacg gtggctgagc tggatggact ggagccagatactgagtata 2340 cggtgcatgt gagggcccat gtggctggcg tggatgggcc ccctgcctctgtggttgtga 2400 ggactgcccc tgagcctgtg ggtcgtgtgt cgaggctgca gatcctcaatgcttccagcg 2460 acgttctacg gatcacctgg gtaggggtca ctggagccac agcttacagactggcctggg 2520 gccggagtga aggcggcccc atgaggcacc agatactccc aggaaacacagactctgcag 2580 agatccgggg tctcgaaggt ggagtcagct actcagtgcg agtgactgcacttgtcgggg 2640 accgcgaggg cacacctgtc tccattgttg tcactacgcc gcctgaggctccgccagccc 2700 tggggacgct tcacgtggtg cagcgcgggg agcactcgct gaggctgcgctgggagccgg 2760 tgcccagaga gcagggcttc cttctgcact ggcaacctga gggtggccaggaacagtccc 2820 gggtcctggg gcccgagctc agcagctatc acctggacgg gctggagccagcgacacagt 2880 accgcgtgag gctgagtgtc ctagggccag ctggagaagg gccctctgcagaggtgactg 2940 cgcgcactga gtcacctcgt gttccaagca ttgaactacg tgtggtggacacctcgatcg 3000 actcggtgac tttggcctgg actccagtgt ccagggcatc cagctacatcctatcctggc 3060 ggccactcag aggccctggc caggaagtgc ctgggtcccc gcagacacttccagggatct 3120 caagctccca gcgggtgaca gggctagagc ctggcgtctc ttacatcttctccctgacgc 3180 ctgtcctgga tggtgtgcgg ggtcctgagg catctgtcac acagacgccagtgtgccccc 3240 gtggcctggc ggatgtggtg ttcctaccac atgccactca agacaatgctcaccgtgcgg 3300 aggctacgag gagggtcctg gagcgtctgg tgttggcact tgggcctcttgggccacagg 3360 cagttcaggt tggcctgctg tcttacagtc atcggccttc cccactgttcccactgaatg 3420 gctcccatga ccttggcatt atcttgcaaa ggatccgtga catgccctacatggacccaa 3480 gtgggaacaa cctgggcaca gccgtggtca cagctcacag atacatgttggcaccagatg 3540 ctcctgggcg ccgccagcac gtaccagggg tgatggttct gctagtggatgaacccttga 3600 gaggtgacat attcagcccc atccgtgagg cccaggcttc tgggcttaatgtggtgatgt 3660 tgggaatggc tggagcggac ccagagcagc tgcgtcgctt ggcgccgggtatggactctg 3720 tccagacctt cttcgccgtg gatgatgggc caagcctgga ccaggcagtcagtggtctgg 3780 ccacagccct gtgtcaggca tccttcacta ctcagccccg gccagagccctgcccagtgt 3840 attgtccaaa gggccagaag ggggaacctg gagagatggg cctgagaggacaagttgggc 3900 ctcctggcga ccctggcctc ccgggcagga ccggtgctcc cggcccccaggggccccctg 3960 gaagtgccac tgccaagggc gagaggggct tccctggagc agatgggcgtccaggcagcc 4020 ctggccgcgc cgggaatcct gggacccctg gagcccctgg cctaaagggctctccagggt 4080 tgcctggccc tcgtggggac ccgggagagc gaggacctcg aggcccaaagggggagccgg 4140 gggctcccgg acaagtcatc ggaggtgaag gacctgggct tcctgggcggaaaggggacc 4200 ctggaccatc gggcccccct ggacctcgtg gaccactggg ggacccaggaccccgtggcc 4260 ccccagggct tcctggaaca gccatgaagg gtgacaaagg cgatcgtggggagcggggtc 4320 cccctggacc aggtgaaggt ggcattgctc ctggggagcc tgggctgccgggtcttcccg 4380 gaagccctgg accccaaggc cccgttggcc cccctggaaa gaaaggagaaaaaggtgact 4440 ctgaggatgg agctccaggc ctcccaggac aacctgggtc tccgggtgagcagggcccac 4500 ggggacctcc tggagctatt ggccccaaag gtgaccgggg ctttccagggcccctgggtg 4560 aggctggaga gaagggcgaa cgtggacccc caggcccagc gggatcccgggggctgccag 4620 gggttgctgg acgtcctgga gccaagggtc ctgaagggcc accaggacccactggccgcc 4680 aaggagagaa gggggagcct ggtcgccctg gggaccctgc agtggtgggacctgctgttg 4740 ctggacccaa aggagaaaag ggagatgtgg ggcccgctgg gcccagaggagctaccggag 4800 tccaagggga acggggccca cccggcttgg ttcttcctgg agaccctggccccaagggag 4860 accctggaga ccggggtccc attggcctta ctggcagagc aggacccccaggtgactcag 4920 ggcctcctgg agagaaggga gaccctgggc ggcctggccc cccaggacctgttggccccc 4980 gaggacgaga tggtgaagtt ggagagaaag gtgacgaggg tcctccgggtgacccgggtt 5040 tgcctggaaa agcaggcgag cgtggccttc ggggggcacc tggagttcgggggcctgtgg 5100 gtgaaaaggg agaccaggga gatcctggag aggatggacg aaatggcagccctggatcat 5160 ctggacccaa gggtgaccgt ggggagccgg gtcccccagg acccccgggacggctggtag 5220 acacaggacc tggagccaga gagaagggag agcctgggga ccgcggacaagagggtcctc 5280 gagggcccaa gggtgatcct ggcctccctg gagcccctgg ggaaaggggcattgaagggt 5340 ttcggggacc cccaggccca cagggggacc caggtgtccg aggcccagcaggagaaaagg 5400 gtgaccgggg tccccctggg ctggatggcc ggagcggact ggatgggaaaccaggagccg 5460 ctgggccctc tgggccgaat ggtgctgcag gcaaagctgg ggacccagggagagacgggc 5520 ttccaggcct ccgtggagaa caaggcctcc ctggcccctc tggtccccctggattaccgg 5580 gaaagccagg cgaggatggg aaacctggcc tgaatggaaa aaacggagaacctggggacc 5640 ctggagaaga cgggaggaag ggagagaaag gagattcagg cgcctctgggagagaaggtt 5700 ttcctggtgt cccaggaggc acgggcccca agggtgaccg tggggagactggatccaaag 5760 gggagcaggg cctccctgga gagcgtggcc tgcgaggaga gcctggaagtgtgccgaatg 5820 tggatcggtt gctggaaact gctggcatca aggcatctgc cctgcgggagatcgtggaga 5880 cctgggatga gagctctggt agcttcctgc ctgtgcccga acggcgtcgaggccccaagg 5940 gggactcagg cgaacagggc cccccaggca aggagggccc catcggctttcctggagaac 6000 gcgggctgaa gggcgaccgt ggagaccctg gccctcaggg gccacctggtctggcccttg 6060 gggagagggg cccccccggg ccttccggcc ttgccgggga gcctggaaagcctggtattc 6120 ccgggctccc aggcagggct gggggtgtgg gagaggcagg aaggccaggagagaggggag 6180 aacggggaga gaaaggagaa cgtggagaac agggcagaga tggccctcctggactccctg 6240 gaacccctgg gccccccgga ccccctggcc ccaaggtgtc tgtggatgagccaggtcctg 6300 gactctctgg agaacaggga ccccctggac tcaagggtgc taagggggagccgggcagca 6360 atggtgacca aggtcccaaa ggagacaggg gtgtgccagg catcaaaggagaccggggag 6420 agcctggacc gaggggtcag gacggcaacc cgggtctacc aggagagcgtggtatggctg 6480 ggcctgaagg gaagccgggt ctgcagggtc caagaggccc ccctggcccagtgggtggtc 6540 atggagaccc tggaccacct ggtgccccgg gtcttgctgg ccctgcaggaccccaaggac 6600 cttctggcct gaagggggag cctggagaga caggacctcc aggacggggcctgactggac 6660 ctactggagc tgtgggactt cctggacccc ccggcccttc aggccttgtgggtccacagg 6720 ggtctccagg tttgcctgga caagtggggg agacagggaa gccgggagccccaggtcgag 6780 atggtgccag tggaaaagat ggagacagag ggagccctgg tgtgccagggtcaccaggtc 6840 tgcctggccc tgtcggacct aaaggagaac ctggccccac gggggcccctggacaggctg 6900 tggtcgggct ccctggagca aagggagaga agggagcccc tggaggccttgctggagacc 6960 tggtgggtga gccgggagcc aaaggtgacc gaggactgcc agggccgcgaggcgagaagg 7020 gtgaagctgg ccgtgcaggg gagcccggag accctgggga agatggtcagaaaggggctc 7080 caggacccaa aggtttcaag ggtgacccag gagtcggggt cccgggctcccctgggcctc 7140 ctggccctcc aggtgtgaag ggagatctgg gcctccctgg cctgcccggtgctcctggtg 7200 ttgttgggtt cccgggtcag acaggccctc gaggagagat gggtcagccaggccctagtg 7260 gagagcgggg tctggcaggc cccccaggga gagaaggaat cccaggacccctggggccac 7320 ctggaccacc ggggtcagtg ggaccacctg gggcctctgg actcaaaggagacaagggag 7380 accctggagt agggctgcct gggccccgag gcgagcgtgg ggagccaggcatccggggtg 7440 aagatggccg ccccggccag gagggacccc gaggactcac ggggccccctggcagcaggg 7500 gagagcgtgg ggagaagggt gatgttggga gtgcaggact aaagggtgacaagggagact 7560 cagctgtgat cctggggcct ccaggcccac ggggtgccaa gggggacatgggtgaacgag 7620 ggcctcgggg cttggatggt gacaaaggac ctcggggaga caatggggaccctggtgaca 7680 agggcagcaa gggagagcct ggtgacaagg gctcagccgg gttgccaggactgcgtggac 7740 tcctgggacc ccagggtcaa cctggtgcag cagggatccc tggtgacccgggatccccag 7800 gaaaggatgg agtgcctggt atccgaggag aaaaaggaga tgttggcttcatgggtcccc 7860 ggggcctcaa gggtgaacgg ggagtgaagg gagcctgtgg ccttgatggagagaagggag 7920 acaagggaga agctggtccc ccaggccgcc ccgggctggc aggacacaaaggagagatgg 7980 gggagcctgg tgtgccgggc cagtcggggg cccctggcaa ggagggcctgatcggtccca 8040 agggtgaccg aggctttgac gggcagccag gccccaaggg tgaccagggcgagaaagggg 8100 agcggggaac cccaggaatt gggggcttcc caggccccag tggaaatgatggctctgctg 8160 gtcccccagg gccacctggc agtgttggtc ccagaggccc cgaaggacttcagggccaga 8220 agggtgagcg aggtcccccc ggagagagag tggtgggggc tcctggggtccctggagctc 8280 ctggcgagag aggggagcag gggcggccag ggcctgccgg tcctcgaggcgagaagggag 8340 aagctgcact gacggaggat gacatccggg gctttgtgcg ccaagagatgagtcagcact 8400 gtgcctgcca gggccagttc atcgcatctg gatcacgacc cctccctagttatgctgcag 8460 acactgccgg ctcccagctc catgctgtgc ctgtgctccg cgtctctcatgcagaggagg 8520 aagagcgggt accccctgag gatgatgagt actctgaata ctccgagtattctgtggagg 8580 agtaccagga ccctgaagct ccttgggata gtgatgaccc ctgttccctgccactggatg 8640 agggctcctg cactgcctac accctgcgct ggtaccatcg ggctgtgacaggcagcacag 8700 aggcctgtca cccttttgtc tatggtggct gtggagggaa tgccaaccgttttgggaccc 8760 gtgaggcctg cgagcgccgc tgcccacccc gggtggtcca gagccaggggacaggtactg 8820 cccaggactg aggcccagat aatgagctga gattcagcat cccctggaggagtcggggtc 8880 tcagcagaac cccactgtcc ctccccttgg tgctagaggc ttgtgtgcacgtgagcgtgc 8940 gagtgcacgt ccgttatttc agtgacttgg tcccgtgggt ctagccttcccccctgtgga 9000 caaaccccca ttgtggctcc tgccaccctg gcagatgact cactgtgggggggtggctgt 9060 gggcagtgag cggatgtgac tggcgtctga cccgcccctt gacccaagcctgtgatgaca 9120 tggtgctgat tctggggggc attaaagctg ctgttttaaa aggc 9164 812148 DNA Homo sapiens 81 gcttcagggt acagctcccc cgcagccaga agccgggcctgcagcccctc agcaccgctc 60 cgggacaccc cacccgcttc ccaggcgtga cctgtcaacagcaacttcgc ggtgtggtga 120 actctctgag gaaaaaccat tttgattatt actctcagacgtgcgtggca acaagtgact 180 gagacctaga aatccaagcg ttggaggtcc tgaggccagcctaagtcgct tcaaaatgga 240 acgaaggcgt ttgtggggtt ccattcagag ccgatacatcagcatgagtg tgtggacaag 300 cccacggaga cttgtggagc tggcagggca gagcctgctgaaggatgagg ccctggccat 360 tgccgccctg gagttgctgc ccagggagct cttcccgccactcttcatgg cagcctttga 420 cgggagacac agccagaccc tgaaggcaat ggtgcaggcctggcccttca cctgcctccc 480 tctgggagtg ctgatgaagg gacaacatct tcacctggagaccttcaaag ctgtgcttga 540 tggacttgat gtgctccttg cccaggaggt tcgccccaggaggtggaaac ttcaagtgct 600 ggatttacgg aagaactctc atcaggactt ctggactgtatggtctggaa acagggccag 660 tctgtactca tttccagagc cagaagcagc tcagcccatgacaaagaagc gaaaagtaga 720 tggtttgagc acagaggcag agcagccctt cattccagtagaggtgctcg tagacctgtt 780 cctcaaggaa ggtgcctgtg atgaattgtt ctcctacctcattgagaaag tgaagcgaaa 840 gaaaaatgta ctacgcctgt gctgtaagaa gctgaagatttttgcaatgc ccatgcagga 900 tatcaagatg atcctgaaaa tggtgcagct ggactctattgaagatttgg aagtgacttg 960 tacctggaag ctacccacct tggcgaaatt ttctccttacctgggccaga tgattaatct 1020 gcgtagactc ctcctctccc acatccatgc atcttcctacatttccccgg agaaggaaga 1080 gcagtatatc gcccagttca cctctcagtt cctcagtctgcagtgcctgc aggctctcta 1140 tgtggactct ttatttttcc ttagaggccg cctggatcagttgctcaggc acgtgatgaa 1200 ccccttggaa accctctcaa taactaactg ccggctttcggaaggggatg tgatgcatct 1260 gtcccagagt cccagcgtca gtcagctaag tgtcctgagtctaagtgggg tcatgctgac 1320 cgatgtaagt cccgagcccc tccaagctct gctggagagagcctctgcca ccctccagga 1380 cctggtcttt gatgagtgtg ggatcacgga tgatcagctccttgccctcc tgccttccct 1440 gagccactgc tcccagctta caaccttaag cttctacgggaattccatct ccatatctgc 1500 cttgcagagt ctcctgcagc acctcatcgg gctgagcaatctgacccacg tgctgtatcc 1560 tgtccccctg gagagttatg aggacatcca tggtaccctccacctggaga ggcttgccta 1620 tctgcatgcc aggctcaggg agttgctgtg tgagttggggcggcccagca tggtctggct 1680 tagtgccaac ccctgtcctc actgtgggga cagaaccttctatgacccgg agcccatcct 1740 gtgcccctgt ttcatgccta actagctggg tgcacatatcaaatgcttca ttctgcatac 1800 ttggacacta aagccaggat gtgcatgcat cttgaagcaacaaagcagcc acagtttcag 1860 acaaatgttc agtgtgagtg aggaaaacat gttcagtgaggaaaaaacat tcagacaaat 1920 gttcagtgag gaaaaaaagg ggaagttggg gataggcagatgttgacttg aggagttaat 1980 gtgatctttg gggagataca tcttatagag ttagaaatagaatctgaatt tctaaaggga 2040 gattctggct tgggaagtac atgtaggagt taatccctgtgtagactgtt gtaaagaaac 2100 tgttgaaaat aaagagaagc aatgtgaagc aaaaaaaaaaaaaaaaaa 2148 82 3370 DNA Homo sapiens 82 gcccccgccc ggcccgccccgctctcctag tcccttgcaa cctggcgctg catccgggcc 60 actgtcccag gtcccaggtcccggcccgga gctatggagc ggcgctggcc cctggggcta 120 gggctggtgc tgctgctctgcgccccgctg cccccggggg cgcgcgccaa ggaagttact 180 ctgatggaca caagcaaggcacagggagag ctgggctggc tgctggatcc cccaaaagat 240 gggtggagtg aacagcaacagatactgaat gggacacccc tctacatgta ccaggactgc 300 ccaatgcaag gacgcagagacactgaccac tggcttcgct ccaattggat ctaccgcggg 360 gaggaggctt cccgcgtccacgtggagctg cagttcaccg tgcgggactg caagagtttc 420 cctgggggag ccgggcctctgggctgcaag gagaccttca accttctgta catggagagt 480 gaccaggatg tgggcattcagctccgacgg cccttgttcc agaaggtaac cacggtggct 540 gcagaccaga gcttcaccattcgagacctt gcgtctggct ccgtgaagct gaatgtggag 600 cgctgctctc tgggccgcctgacccgccgt ggcctctacc tcgctttcca caacccgggt 660 gcctgtgtgg ccctggtgtctgtccgggtc ttctaccagc gctgtcctga gaccctgaat 720 ggcttggccc aattcccagacactctgcct ggccccgctg ggttggtgga agtggcgggc 780 acctgcttgc cccacgcgcgggccagcccc aggccctcag gtgcaccccg catgcactgc 840 agccctgatg gcgagtggctggtgcctgta ggacggtgcc actgtgagcc tggctatgag 900 gaaggtggca gtggcgaagcatgtgttgcc tgccctagcg gctcctaccg gatggacatg 960 gacacacccc attgtctcacgtgcccccag cagagcactg ctgagtctga gggggccacc 1020 atctgtacct gtgagagcggccattacaga gctcccgggg agggccccca ggtggcatgc 1080 acaggtcccc cctcggccccccgaaacctg agcttctctg cctcagggac tcagctctcc 1140 ctgcgttggg aacccccagcagatacgggg ggacgccagg atgtcagata cagtgtgagg 1200 tgttcccagt gtcagggcacagcacaggac ggggggccct gccagccctg tggggtgggc 1260 gtgcacttct cgccgggggcccgggcgctc accacacctg cagtgcatgt caatggcctt 1320 gaaccttatg ccaactacacctttaatgtg gaagcccaaa atggagtgtc agggctgggc 1380 agctctggcc atgccagcacctcagtcagc atcagcatgg ggcatgcaga gtcactgtca 1440 ggcctgtctc tgagactggtgaagaaagaa ccgaggcaac tagagctgac ctgggcgggg 1500 tcccggcccc gaagccctggggcgaacctg acctatgagc tgcacgtgct gaaccaggat 1560 gaagaacggt accagatggttctagaaccc agggtcttgc tgacagagct gcagcctgac 1620 accacataca tcgtcagagtccgaatgctg accccactgg gtcctggccc tttctcccct 1680 gatcatgagt ttcggaccagcccaccagtg tccaggggcc tgactggagg agagattgta 1740 gccgtcatct ttgggctgctgcttggtgca gccttgctgc ttgggattct cgttttccgg 1800 tccaggagag cccagcggcagaggcagcag aggcacgtga ccgcgccacc gatgtggatc 1860 gagaggacaa gctgtgctgaagccttatgt ggtacctcca ggcatacgag gaccctgcac 1920 agggagcctt ggactttacccggaggctgg tctaattttc cttcccggga gcttgatcca 1980 gcgtggctga tggtggacactgtcatagga gaaggagagt ttggggaagt gtatcgaggg 2040 accctcaggc tccccagccaggactgcaag actgtggcca ttaagacctt aaaagacaca 2100 tccccaggtg gccagtggtggaacttcctt cgagaggcaa ctatcatggg ccagtttagc 2160 cacccgcata ttctgcatctggaaggcgtc gtcacaaagc gaaagccgat catgatcatc 2220 acagaattta tggagaatgcagccctggat gccttcctga gggagcggga ggaccagctg 2280 gtccctgggc agctagtggccatgctgcag ggcatagcat ctggcatgaa ctacctcagt 2340 aatcacaatt atgtccaccgggacctggct gccagaaaca tcttggtgaa tcaaaacctg 2400 tgctgcaagg tgtctgactttggcctgact cgcctcctgg atgactttga tggcacatac 2460 gaaacccagg gaggaaagatccctatccgt tggacagccc ctgaagccat tgcccatcgg 2520 atcttcacca cagccagcgatgtgtggagc tttgggattg tgatgtggga ggtgctgagc 2580 tttggggaca agccttatggggagatgagc aatcaggagg ttatgaagag cattgaggat 2640 gggtaccggt tgccccctcctgtggactgc cctgcccctc tgtatgagct catgaagaac 2700 tgctgggcat atgaccgtgcccgccggcca cacttccaga agcttcaggc acatctggag 2760 caactgcttg ccaacccccactccctgcgg accattgcca actttgaccc cagggtgact 2820 cttcgcctgc ccagcctgagtggctcagat gggatcccgt atcgaaccgt ctctgagtgg 2880 ctcgagtcca tacgcatgaaacgctacatc ctgcacttcc actcggctgg gctggacacc 2940 atggagtgtg tgctggagctgaccgctgag gacctgacgc agatgggaat cacactgccc 3000 gggcaccaga agcgcattctttgcagtatt cagggattca aggactgatc cctcctctca 3060 ccccatgccc aatcagggtgcaaggagcaa ggacggggcc aaggtcgctc atggtcactc 3120 cctgcgcccc ttcccacaacctgccagact aggctatcgg tgctgcttct gcccgcttta 3180 aggagaaccc tgctctgcaccccagaaaac ctctttgttt taaaagggag gtgggggtag 3240 aagtaaaagg atgatcatgggagggagctc aggggttaat atatatacat acatacacat 3300 atatatattg ttgtaaataaacaggaaatg attttctgcc tccatcccac ccatcagggc 3360 tgcaggcact 3370 8313863 DNA Homo sapiens misc_feature (1)..(13863) n = a, c, g or t 83aagcttagga agcacaagag gctgagcctt tcaggtcagc aaagacttcc cagaggaggc 60agtgcctaca ctgaggtcag agtgacaaga agagtaatgg accactgtaa agacttgggt 120tcggccgggc gcggtggctc acgcctgtaa tcccagcact ttgggaggcc gaggcgggtg 180gatcatgagg tcaggagatc gagaccatcc tggctaacaa ggtgaaaccc cgtctctact 240aaaaatacag aaaattagcc gggcgcggtg gcgggcgcct gtggtcccag ctactcggga 300ggctgaggca ggagaatggc gtgaacccgg gaagcggagc ttgcagtgag ccgagattgc 360gccactgcag tccgcagtcc ggcctgggcg acagagcgag actccgtctc aaaaaaaaaa 420aaagacttgg gtttgacttg attgagccca ggagttcgag acaagcctgg gcaatatagt 480gagacctcat ctctacaaaa attttaaaaa ttagcctggt gcggtggctc atgcctgtaa 540tcccagcact ctgggaggcc gaggtgggcg gatcacttga ggtcagaagt ttgagaccac 600cctgaccaac atggagaaac cccgtctcta ctaaaaatac aaaattagcc gggcatggtg 660gcgcatgcct gtaatcccag ctactcggga ggctgaggca ggagaattgt ttgaacctgg 720gaggtggacg ttgcggtgag ccaagatcac actattgcac tccagcctgg gcaacaagag 780caaaactccg tctcaaaaaa aaaaatttat ttttaaatta gccaggtgta gccacagctg 840tagtcaaatc tactaggcag gctgaggtgg gaggattgct tgaacctggg aggcagaggt 900tgcagtgagc caagatggtg ccacggcatt ccagcctgag caacagcaag accctgtgtc 960caaaaaaaaa aaaaaaaaaa accgtaaaat aggccaggca cagtggttca tggttataag 1020cctagcactt tggaaggctg aggagggtgg atcgcctgag ctcaggagtt caagaccagc 1080ctgggcaaca cggtgaaacc ccatctctac caaaaaaaaa aaaaaaaaaa attagccagg 1140catggtggtg tgtgcctgtg gtcccagcta ctcaggaggc tgaggtggaa gagtgcttgt 1200gcctgggagg cagaggttcc agtgaaccga gatcacacca ttgtactcca gcctgggcaa 1260cagagtaaga ccccatctca aaaaaaaaaa aaaaaattaa gataaaccct ttggcagctg 1320cgtgctgctc ttagcctcaa acccaagtct tttttttccc cctttgagac ggggtctatt 1380gcccaggctg gagtgcaatg gtatgatcca tactcactgc agccccgaac tcctgggctt 1440ccaaagtgct gggattacag gtgtgagcca ccaggcccag actgctgaag ggtttaaacc 1500agagaaagaa tgtgaccaga tttccaattt agaaagaccc gctctctgca gggtaaggag 1560agcctggggg tccgggggcg gggggcaaga attgcaaggt aaccagggag gccagtgcaa 1620tgtccaggtg ggagaggatg ctagctgaga ctagaagtgc taggaaaagg atgtgtgcag 1680acaagaggtc actggggagg tgaaataaca aggcttggcc atgagtggaa cccaacaccc 1740atggtgccct cttgagagag ggaagatggc acctgagatg gaagatggaa agaccagggt 1800ccctgtgact gaggactgag cctctgtttg aggtttttgc agaggagtaa aggcaacaaa 1860agaggcaaga gttggaagaa aggtgacaag gaacaaaagt cagctatgcc tgatgctact 1920gggtggccag caacaatgct gacttggcca aggctctgag agctttacta tgctgggact 1980ggaggtcaga gttgaggcta gggtaagagc aaggggctca gagatggagg gggaggagga 2040cctgaacaag tccagaaggg aagagatttg tccctctatc caacagagta cccagtgagc 2100agcacagagg gcacagcaag ggacatcacc cggttcccca aatgctcaga gccacaagtg 2160aagccaaaag tgaaagacaa gatgcagaaa accgccacgg gcctttgagg aagggtaaag 2220gcgaaagcga aagcaggaag tacagacgtg aagcctagca gaggactttt tagctgctca 2280ctggccccgc ttgtctggcc gactcatccg cccgcgaccc ctaatcccct ctgcctgccc 2340caagatgctg aagccagccc tggagccccg agggggcttc tccttcgaga actgccaaag 2400gtgaagcggg ggcgcggggg gcggtcactc ctgagccgcc tctgcttgct cgtggccttt 2460tttcctggct gggggtgggg gagggtgtgt tggtcgactt gggttccagg cttaccccgg 2520aagatgaggg agacggggac caggttaggg gaagcaacag gggtcttgaa agcagagccg 2580aaacatgggc gccctcctcc gtttccagaa atgcatcatt ggaacgcgtc ctcccggggc 2640tcaaggtccc tcacgcacgc aagaccggga ccaccatcgc gggcctggtg ttccaagtga 2700gcagcgggga gggacgggga gctggagggg agccgagagt atcgagcagg cactgaagct 2760gcggtccctc cctctcctca ggacggggtc attctgggcg ccgatacgcg agccactaac 2820gattcggtcg tggcggacaa gagctgcgag aagatccact tcatcgcccc caaaatctag 2880tgagactccc gagcccagtt cccgtacgca aaaaagaacg gccccctcgt tcccactccg 2940gtccccgcac gtcccagccc tgcccacacc gatcctccct tttgcctcag ctgctgtggg 3000gctggagtag ccgcggacgc cgagatgacc acacggatgg tggcgtccaa gatggagcta 3060cacgcgttat ctacgggccg cgagccccgc gtggccacgg tcactcgcat cctgcgccag 3120acgctcttca ggtgcggggg cagggctaac aggaccccgg caggtagttt acggggttgg 3180ggccattgga aggcgggaca gaaagaaggg cgggaccgcg acgggccagg tgaccggaag 3240aggccggccc aagagaacct gggctacagg aaaaggcgat gtcagtcatc gggcgccagc 3300ccacaggaag gagcggggat agcacctagg agctgggcat agagaggtgg gcctaggccc 3360cagcttgtgg ccgaccccgc ccatcctcga gcaggtacca gggccacgtg ggtgcatcgc 3420tgatcgtggg cggcgtagac ctgactggac cgcagctcta cggtgtgcat ccccatggct 3480cctacagccg tctgcccttc acagccctgg gtgagcgctt ctgtcccttc tcctcgaact 3540ctgcccctgg tgaccttggc ctcactccaa acggcgtcgc agcggttgac ttcagatgct 3600tctcctgcct tcaggctctg gtcaggacgc ggccctggcg gtgctagaag accggttcca 3660gccgaacatg acggtgagcg gcctctgtcc ccgactttgt ggtcgctggt gggatgtgca 3720cccgggagct gggggagcac aggaccctgg cccagtgcgg gtggctaagg cttgtcggag 3780gaggtgacca ctgaagggtg agtggagtaa gggcagagaa gtgcggtccc gacataacac 3840cgtccaatac caaagcctgc acggctggga gaagtcgaag ctcacagagg atctttagga 3900gccgagggcg gagagaagga ccagtagggt cctacttata tcaacgtctg gagcctagat 3960tttgtttggg gtgggatgga agcaggtgat gttgcctcag aggtggctaa ggctcagagg 4020gagaaacaca gtgggggttt ggagggcaag accagattgg gtaagtggac aggcaagtcc 4080ccaggctgta gcctaagtta acagcagaga gagcccgtta ggtctcacac acccatcacc 4140gcagctggag gctgctcagg ggctgctggt ggaagccgtc accgccggga tcttgggtga 4200cctgggctcc gggggcaatg tggacgcatg tgtgatcaca aagactggcg ccaagctgct 4260gcggacactg agctcaccca cagagcccgt gaagaggtga gagctggaga tcggggacca 4320cagggatgtg tggggctata gcaggggaga tagggggctg caaaaagggg atgggccaca 4380tgacaggccc atgttcagag gctgtccctc ctccctccca ggtctggccg ctaccacttt 4440gtgcctggaa ccacagctgt cctgacccag acagtgaagc cactaaccct ggagctagtg 4500gaggaaactg tgcaggctat ggaggtggag taagctgagg cttagagctt ggaacaaggg 4560ggaataaacc cagaaaatac agttaaacag atggctgtgt cattcttgag tggaatgggg 4620tgggcaggca gccagcaggg ctctgtagct aaggcgtccc tgcaggggcc attacctacc 4680atagctctag tgtctggcct aagagatgcc cttcacccat aacctcaggc acctacaact 4740ccagaacccc agccctggcc agcattgcag gcttggtctc cacccaaacc ttccttctga 4800ctccacactt gaaggctccc ccaccactcc actgtcttgc tcttgccctc tagtccactg 4860ggagacttgt aaattatgaa ataccccatg tactaccccc tcctagagac tttccatggc 4920tcctcagtgg cccaggacaa gctcatacct ttcaatcagg cccccacagg ccccactgag 4980ggctaaagtg ctgacaagag gagccgctcc ctgactccaa ggcaagttct caccaagcac 5040tcctcaacct cgcaacatct ttacctgtga caccccttag atgacgaggc atgcctgcac 5100tgctcacgtg aagctcgtct tctgtctgca catgctgggc ttgtgactcc aagttttcca 5160ggctaataag ggtcacagga ctcacatggg gagagatgac acgtttctcc aacaaacctt 5220tgctgggccc ctgctgagtc tcaggcctgg ctgctgggtg ccagcaagag catcctgtcc 5280tcagcgagaa cggctgaact ccgctggagc ttcagaaatg tcagggagag tctacccagg 5340gcccagggag ggtctatgcc gggctgcaca tccccaggct gctgagtgtg ctccctgcac 5400cccaacattc tattaatgaa catttgtaaa tgtaacagaa aagtagaaag agttgtatat 5460tgaataccct tatactgtca ggtcaccaca gacctgacag tattttgtta tatttgtttt 5520atcatctatt catccctcta tccattaatt catcgctcct tttttttttt tttttttttt 5580tttgagacgg cgtctcgctc tgtcacccag gctctggagt gcaaatattt tgttatattt 5640gttttatcat ctattcatcc ctctatccat taattcatcg ctcctttttt tttttttttt 5700tttgagacgg agtctcgctc tgtcacccag gctctggagt gcagtggcgc aatctcagct 5760cactggaagc tccgcctccc aggttcacgc cattctcctg cctcagcctc ccgagtagct 5820gggactacag gtgcccgcca ccacgcgcgg ctaatttttt tttttttttg tatttttagt 5880agagacgagg ttctactgaa cctgttagcc aggatggtct ttgatctcct gacctcatga 5940tccgcccgcg tcggcctccc aaagtgctgg gattacaggc gtgagccacc gtgcccagcc 6000aattcatctc attttttggc tgatgctgtt tctttgagat ggggtctagc tccatcgccc 6060aggccggaat gcagtggtgc actcatggct cactgcagcc ttgaacttaa gggctcaagt 6120gatccctcct gcctcagcct tctgagttgc tgggactaca ggtgtgtacc atcataccca 6180gcacatttct taatttaaaa aaattttttt tgtagagaca gggtttcatg atgttgctca 6240ggctggtctc gaactcctgg aatcaagcct cctacgtctg cctcccaaag ttttgggatt 6300acaggtgtga gccaccacac ccagccctga tctgttcttg aatcagttaa agccctcaca 6360ctcccagaag gccgccagcc aatgcacctg ttggaacttt gcacacaggg tgtcttctcc 6420cttcaagctt ggtctgcagc tcagtaacaa atgggctaca gacaccaggg gcttgcccat 6480gggagcccca aggcctaaag agggtggcag agatttgatg tctgtcactc tccacctgca 6540gcctcagtcc acggtcggcc aggcaccaag agctcacact ttgccctcct aaatgccagg 6600cccttcataa gtatcatctc attgttaaga gcggaggctt cagcgccaga caaatgcgag 6660tttgcgtaca actcaaccac gtgctggtgg gagagtcacc atctctgagc agacctgtga 6720ctcctgttcc aaatggacga ggaaccactg cgatgatgtg ttaggactcc cagcctgcca 6780gaacctcaca gcccctggcc cttcacagca aagttgaccg cagtgagcat tccatccacc 6840agtcagaaca ccctggacgc tgagcggacc ttctctgaaa gcctggtgcc tttgttagcc 6900ctgggtgact cctgtgatcc cagccaccag gttgtcacta tagacctaat ttaaccatct 6960gtcctcagta ccgagggctc aacatttgga atgggaggtg gttctgggag ccaattagag 7020gccaggcttt gggaggtggc agaggtgagt ctcacacctt gggctctgtc tgataagtct 7080aggtctcggt caggggacct tggcctaaag ggcctgtctt gcctggagcg tgggaggggg 7140ctgagtctac acagctggcc tggcctcagg cctggagctt tagctcaagg acgagaagac 7200ccataaagcc agacccagct cccaacctca catctgccac gatgttgctg ctcagcctga 7260ccctaagcct ggttctcctc ggctcctcct ggggtgagtg ggccaggacc agccctgatt 7320cagccctggg agcaactcag ctcccagcaa cagcccaggg aaggagctag gctggctgga 7380agggacgaag gtggacagag tgggtaaaag aaacaggata tgccagggca gtggagcagg 7440gaacagtcct gcagggctgg gagggggcaa gaggtggggt ggtctcacaa ataggaccag 7500agattgagcc aggccctgga gcccgggagg gtttaggaag ctgagacagg aagacctgtc 7560catgtctttt agaaagaacc ttctggctgc atgaagggta tgaactgttc aggtcgggag 7620ggggcagaga gaccaggggt agagatgggg aacagcgggg actaggctgg agacagatgt 7680aggagaacag cagggctggg ggactgggtg gatagggata accaagatag ctgtggggcc 7740cgaaggtgct tgcatgtacc ctgttgggga aggggtagtg ctgtaccctc tcgacagacc 7800tctctggggt gcacagcctg gggcacccaa aaggaggtgg ggaaagatgg gctgaggcat 7860gggaagcagg tcctcattag cccaatggcc aggctgcggc attcctgcca tcaaaccggc 7920actgagcttc agccagagga ttgtcaacgg ggagaatgca gtgttgggct cctggccctg 7980gcaggtgtcc ctgcaggtac accaccagag gggtgggcag ggtcctgggt acgtcatgcc 8040taggggcagc ctcagcagcc catccccact ctgacctctg agccctgacc acaggacagc 8100agcggcttcc acttctgcgg tggttctctc atcagccagt cctgggtggt cactgctgcc 8160cactgcaatg tcaggtgagt gcctgcattc cacctgcccc gcccctcgcc tcttcctgcc 8220tcctcccctg gctgtccccc tctcgcgctg gcctccctgc agctgcctaa tcccaccccc 8280ttgcagccct ggccgccatt ttgttgtcct gggcgagtat gaccgatcat caaacgcaga 8340gcccttgcag gttctgtccg tctctcgggt gagtgcctgg gctgcagaca cggaggaaaa 8400gtgggcagtg caggtgggtg ggtgctggga acgaggaatt caggacatgc cctggcctac 8460cctgctcagc acccatcaga acatggactg tttctgaccc cacaggccat tacacaccct 8520agctggaact ctaccaccat gaacaatgac gtgacgctgc tgaagctcgc ctcgccagcc 8580cagtacacaa cacgcatctc gccagtttgc ctggcatcct caaacgaggc tctgactgaa 8640ggcctcacgt gtgtcaccac cggctggggt cgcctcagtg gcgtgggtag ggactcaggc 8700caaagctcag ggtgggagga ctggggtggg gacagtgttc tgggccccat gtgaccaccc 8760ctcctggcca caggcaatgt gacaccagca catctgcagc aggtggcttt gcccctggtc 8820actgtgaatc agtgccggca gtactggggc tcaagtatca ctgactccat gatctgtgca 8880ggtggcgcag gtgcctcctc gtgccaggta agccccagca cccgctcctc tgcgctgtcc 8940tagtggtata cctccccaac cccccctact caattctccc tccctcttcc ctctcagggt 9000gactccggag gccctcttgt ctgccagaag ggaaacacat gggtgcttat tggtattgtc 9060tcctggggca ccaaaaactg caatgtgcgc gcacctgctg tgtatactcg agttagcaag 9120ttcagcacct ggatcaacca ggtcatagcc tacaactgag ctcaccacag gccctcccca 9180gctcaaccca ttaaagaccc aggccctgtc ccatcatgca ttcatgtctg tcttcctggc 9240tcaggagaaa gaagaggctg ttgagggtcc gactccctac ttggacttct ggcacagaag 9300gggctgagtg actccttgag tagcagtggc tcttcctaga gtagccatgc cgaggccggg 9360gcccccaccc ctcctccagg gcaacccctt ggtcctacag caagaagcca gaactgttgg 9420aatgaatggc agccctccct ggagaggcag cctgtttact gaatacagag gatacgttta 9480caaactgaat acgcataata aataactgca cattctccat ccacaggcca tggcatgaag 9540gcccaagtgg gtctatcaaa ggcccacatc tccaaacccc tgtcctgccc tcaggaccag 9600gcccaccctg ggcaagagag aacgtaagcc ccagggcttc aggtccccag agacacttgg 9660ggaactgggg ggaaattctg aggccatggg gcttggttct ccactgcctc ctgcccaggg 9720ggatttgggg acggtaggag gatgtgtcta aggcatagtc gacttggcac agagtggtct 9780ctttagtttt gtttcccact ggaggtggca catgcaggaa aagggcctgg cccaggctgc 9840cgaccggcag aagctgagtg ggaaccaaac cctcctgcaa ttggcagggc cctgccgtca 9900agctaaggcc aaagctgggc cctgggccca ttctacccac tgaaggcagc tgtggaggaa 9960ggggcttggg ttccagcctg gtttgtggta gggggagata ccacaaaaga aatggggatg 10020gttctggctc aggcctctgg gaaagcagcc acccaacccc acccacctcc cgcaggggct 10080ccttccagct tgaggctcag tgggacccag actggaaggt taatgctgtg aagggaagca 10140gcacagggtg gacggggcaa ggccagctgt gagaaggcag tgcccctggc accctggttt 10200cagaggcagg tcacacagta tggctaagtt ccagggaggg gtgcgcagaa gctcagcaga 10260aggggagagg tgagcagccc gggaccctcc cccagggcgg caactcctac cttcccatgt 10320cctcatggag gactacaggt gtgcaccatg ggtgggtgtg cacgatgggc aggtgtgcac 10380gatgggcgtg cagtgatcac tcccaggctg ccaacaccca tgcagacacc agatggcgcc 10440ttcgtgcagc tgcagaggag ggagcaacag agcctgaagg gaaaaggcaa tggggctgca 10500ccaaaggata gaacccaggc tgacactcga ccctaatcgg gaggaccccc ttccctctgc 10560cttggccccc aggtgcccca ttccccaggt agcagcagtg gggctccctt taaccacccc 10620cagttgggaa ggaggcacct ggggaatgga atggacatca acggggagag ggaggtagcg 10680gtgctctaca aagaaggcac caagggcggt gggctgagac ccctcagaat cttggagagg 10740ctggagcctg ggcaagccga tgaccagcat ggccacacag tccagaaggg tgaaggtcca 10800cgccatggcc ctccaccaga ggtcctggga ccaggaaggc tccctggagg caccatgaag 10860gaagacagat cttggctggg aggtggaggg ctgtttcgac ctagccaggg gctacgggtc 10920cagtcaaggc acaagctttg tgcctaccag ggtctcccac tggagcataa tcttaaggat 10980caggatgcat gggaatgtgt gaaaccaggg agaagggctc tgtggaggaa agggggtccc 11040agaagtaact gtcccaaagg gtcctgaggc cacaggacac tccacccagc actgcagttc 11100cctttgattg gggaaaagtc aaagggcaag ggagacagtg aaggccaggt cctatccctt 11160cccaactcca ccagagcagc tgcccaccaa gaggggtatc agtgccagcc aggctcccag 11220ttcaggggga gtcacagccc cctgtgctac ctctactctg tcacacctgg cccaggccat 11280ggtgaggaca ggggctgctg aaggcacaga gaaagggctg gagccagaca ttcttcacct 11340actgtgggcc acataggcct atctccagag agggcatcgg acccagatgg caccacagtg 11400tgtggccagg ctgggtcgtg ctgcatgtgt gcacagccag gcggctcagc cattgtattg 11460ctgctggtag cgcaggttga gctcccgcag ctcccgttcc cgcacacggc gtgacttatt 11520ggagcgtgtg gagcggctgg aacgcgtgga ctgggcagat ttggtgctct ggcagcgcga 11580ggaggcacgt ttaaggaggt tctgggatat ggagcggtgc aggttcttca tggatgaaga 11640ggcagccatg ctcaccaccc acgggtgcct cagggcctgc agtgcagtca tacgggctcc 11700agggtccact gtcagcaggc ggtcaatgaa gtccttggcc aggttggaca cactaggcca 11760gggctagaga ccaaggacaa gcattagagt gagagcatct gacactgccc accccatctg 11820gatgaggcca ctactcagca accctcccct ttccagagag aggtgctgcc cctcctctca 11880tgtagcactt ggggcctccc cgcccaacgc tggctcaggc tgaacaaggg ctgctctcca 11940ggtgatggag tctggcaagg aaggaaagga cctgtgcact ctcccaggga gcaaattcta 12000tggtgcactg gacccgaagc ctggctccag ggagatggcc tctgccaaga ccccccggaa 12060cgtgtcccag gagtatcata actcagggga ctgttagaga atgattcaaa ctttcccacc 12120acatcctaag tcagattgaa gctccaatct ctggatgacc aggatcaggc tacttaaagg 12180ggaacttcct agtccttaca gagaagatcc aacctctctc caactgccga agcagtggca 12240gaagaccact gctccctgcc tctcctcccg gcatggggag gaaaggaaac aattcaaggc 12300aactagattt cccagtcggc tgagggcagg cgatcccggg ccaggaagga accaggaccc 12360ttctcagtgg caccctctgg cccgcattac ttctctaagc cacaaagggc tcctggcagt 12420gctgtgcgcc agcctcattt tagtacattc tgtcccctgg gaggaactcc ataaagccca 12480ctctgccaca tgcaccccgg gctgcctcat ctcagccccg aacccagcag ctgtctgtct 12540cagggcctca ggttgtacgg ctgtcttcac ctgactggat cctcaggttc tcagggtaaa 12600ggacacttgc tcagactccc tcttagcccc cagtgcttcc agcaattatt ccagctgtaa 12660cgtgagactg caatttcatg ttcgtttagt attcccatga gatcatgctg agctggatga 12720gcccggcctg gtgctgcgca tacaggaagc actcagtagg cacaggctca gacagtaaac 12780aacccacggt gctgccggat gggtgccctt tcctggagct gcttccaggc cttggggctc 12840agccaggtga gtccttgcgt ccctgcatct cctaggaaca cttctggcac gggctctgag 12900gctcccccaa ggataggcag ctaggacctt tcctgagcct gctgcagatg actcaacagg 12960gatgctaacg atcccctcat cttccttcct gccaggtgag gtctgcctgt tccacccatg 13020gtacccttca ccttgaggaa cccctgaaca tgccctccag ggggttcagg aggatctgag 13080agaccacctt cagggcaggt gcacagccat ctagcagaca cacacactca ctgactactg 13140ctactcccag tctggctcgc ctgacctcca actctttccc tacccccttc cccactgcca 13200cagagggatg aggcanngag aacacgcttc caccgtcctg aggaaggcnt ggggctacct 13260gcagctgctg tcttcaccca ctctttggaa ggttattcca agttttactg agctgaagtg 13320ggagcaacag gggaaccata ttcccaaaca cacctaacag ggtcatcctc atcagtgggc 13380cagcagcaca cagtgactcc tggggagatg ctggccccag gaggaggaag tcagggtcca 13440ggagcatgca gccaacgaag gcccatagat gccttactat ccaagggctg tgggtgggcg 13500cagagagcaa cagccctccc cgacaggcag gtaagtctcc tgggggcttg tgtagttcaa 13560gattcatatt gagggccagg cgtggtggct catgcctgta atcccagcac tttggggagg 13620ctgaggcagg tggatcacaa ggtcatgaga tcaagaccat cctggccaac atggtgaaac 13680cccgtctcta ctaaaaatac aaaaattagt cgggcgtggt ggcgtgcctg tagtccagct 13740actcaggaag ctgaggcagg agaattgctt gaacctgaga ggcggaggtt gcagtgagcc 13800aagatcgcac cactgcactc caggctggga aagagggggg ttccgtttcc aaaaaaaaaa 13860aaa 13863 84 3044 DNA Homo sapiens misc_feature (1)..(3044) n = a, c, gor t 84 aggcagggcg ggcgggcgct ctaagggttc tgctctgact ccaggttgggacagcgtctt 60 cgctgctgct ggatagtcgt gttttcgggg atcgaggata ctcaccagaaaccgaaaatg 120 ccgaaaccaa tcaatgtccg agttaccacc atggatgcag agctggagtttgcaatccag 180 ccaaatacaa ctggaaaaca gctttttgat caggtggtaa agactatcggcctccgggaa 240 gtgtggtact ttggcctcca ctatgtggat aataaaggat ttcctacctggctgaagctg 300 gataagaagg tgtctgccca ggaggtcagg aaggagaatc ccctccagttcaagttccgg 360 gccaagttct accctgaaga tgtggctgag gagctcatcc aggacatcacccagaaactt 420 ttcttcctcc aagtgaagga aggaatcctt agcgatgaga tctactgcccccctgagact 480 gccgtgctct tggggtccta cgctgtgcag gccaagtttg gggactacaacaaagaagtg 540 cacaagtctg ggtacctcag ctctgagcgg ctgatccctc aaagagtgatggaccagcac 600 aaacttacca gggaccagtg ggaggaccgg atccaggtgt ggcatgcggaacaccgtggg 660 atgctcaaag ataatgctat gttggaatac ctgaagattg ctcaggacctggaaatgtat 720 ggaatcaact atttcgagat aaaaaacaag aaaggaacag acctttggcttggagttgat 780 gcccttggac tgaatattta tgagaaagat gataagttaa ccccaaagattggctttcct 840 tggagtgaaa tcaggaacat ctctttcaat gacaaaaagt ttgtcattaaacccatcgac 900 aagaaggcac ctgactttgt gttttatgcc ccacgtctga gaatcaacaagcggatcctg 960 cagctctgca tgggcaacca tgagttgtat atgcgccgca ggaagcctgacaccatcgag 1020 gtgcagcaga tgaaggccca ggcccgggag gagaagcatc agaagcagctggagcggcaa 1080 cagctggaaa cagagaagaa aaggagagaa accgtggaga gagagaaagagcagatgatg 1140 cgcgagaagg aggagttgat gctgcggctg caggactatg aggagaagacaaagaaggca 1200 gagagagagc tctcggagca gattcagagg gccctgcagc tggaggaggagaggaagcgg 1260 gcacaggagg aggccgagcg cctagaggct gaccgtatgg ctgcactgcgggctaaggag 1320 gagctggaga gacaggcggt ggatcagata aagagccagg agcagctggctgcggagctt 1380 gcagaataca cagccaagat tgccctcctg gaagaggcgc ggaggcgcaaggaggatgaa 1440 gttgaagagt ggcagcacag ggccaaagaa gcccaggatg acctggtgaagaccaaggag 1500 gagctgcacc tggtgatgac agcacccccg cccccaccac cccccgtgtacgagccggtg 1560 agctaccatg tccaggagag cttgcaggat gagggcgcag agcccacgggctacagcgcg 1620 gagctgtcta gtgagggcat ccgggatgac cgcaatgagg agaagcgcatcactgaggca 1680 gagaagaacg agcgtgtgca gcggcagctc gtgacgctga gcagcgagctgtcccaggcc 1740 cgagatgaga ataagaggac ccacaatgac atcatccaca acgagaacatgaggcaaggc 1800 cgggacaagt acaagacgct gcggcagatc cggcagggca acaccaagcagcgcatcgac 1860 gagttcgagg ccctgtaaca gccaggccag gaccaagggc agaggggtgctcatagcggg 1920 cgctgccagc cccgccacgc ttgtctttag tgctccaagt ctaggaactccctcagatcc 1980 cagttccttt agaaagcagt tacccaacag aaacattctg ggctgggaaccagggaggcg 2040 ccctggtttg ttttccccag ttgtaatagt gccaagcagg cctgattctcgcgattattc 2100 tcgaatcacc tcctgtgttg tgctgggagc aggactgatt gaattacggaaaatgcctgt 2160 aaagtctgag taagaaactt catgctggcc tgtgtgatac aagagtcagcatcattaaag 2220 gaaacgtggc aggacttcca tctgtgccat acttgttctg tattcgaaatgagctcaaat 2280 tgattttttt aatttctatg aaggatccat ctttgtatat ttacatgcttagaggggtga 2340 aaattatttt ggaaattgag tctgaagcac tctcgcacac acagtgattccctcctcccg 2400 tcactccacg cagctggcag agagcacagt gatcaccagc gtgagtggtggaggaggaca 2460 cttggatatt tttttagttc tttttttttt ggcttaacag ttttagaatacattgtactt 2520 atacacctta ttaatgatca gctatatact atttatatac aagtgataatacagatttgt 2580 aacattagtt ttaaaaaggg aaagttttgt tctgtatatt ttgttaccttttacagaata 2640 aaagaattac atatgaaaaa ccctctaaac catggcactt gatgtgatgtggcaggaggg 2700 nagtggtgga gctggacctg cctgctgcag ctgcagtcac gtgtaaacaggattattatt 2760 agtgttttat gcatgtaatg gactatgcac acttttaatt ttgtcagattcacacatgcc 2820 actatgagct ttcagactcc agctgtgaag agactctgtc tgcttgtgtttgtttgcagt 2880 ctctctctgc catggccttg gcaggctgct ggaaggcagc ttgtggaggccgttggttcc 2940 gcccactcat tccttctcgt gcactgcttt ctccttcaca gctaagatgccatgtgcagg 3000 tggattccat gccgcagaca tgaaataaaa gctttgcaaa ggca 3044 851953 DNA Homo sapiens 85 cgctcccacc cgcccgtggc ccgcgcccat ggccgcgcgcgctccacaca actcaccgga 60 gtccgcgccc tgcgccgccg accagttcgc agctccgcgccacggcagcc agtctcacct 120 ggcggcaccg cccgcccacc gccccggcca cagcccctgcgcccacggca gcaatcgagg 180 cgaccgcgac agtggtgggg gacgctgctg agtggaagagagcgcagccc ggccaccgga 240 cctacttact cgccttgctg attgtctatt tttgcgtttacaacttttct aagaactttt 300 gtatacaaag gaacttttta aaaaagacgc ttccaagttatatttaatcc aaagaagaag 360 gatctcggcc aatttggggt tttgggtttt ggcttcgttttttctcttcg ttgactttgg 420 ggttcaggtg ccccagctgc ttcgggctgc cgaggaccttctgggccccc acattaatga 480 ggcagccacc tggcgagtct gacatggctg tcagcgacgcgctgctccca tctttctcca 540 cgttcgcgtc tggcccggcg ggaagggaga agacactgcgtcaagcaggt gccccgaata 600 accgctggcg ggaggagctc tcccacatga agcgacttcccccagtgctt cccggccgcc 660 cctatgacct ggcggcggcg accgtggcca cagacctggagagcggcgga gccggtgcgg 720 cttgcggcgg tagcaacctg gcgcccctac ctcggagagagaccgaggag ttcaacgatc 780 tcctggacct ggactttatt ctctccaatt cgctgacccatcctccggag tcagtggccg 840 ccaccgtgtc ctcgtcagcg tcagcctcct cttcgtcgtcgccgtcgagc agcggccctg 900 ccagcgcgcc ctccacctgc agcttcacct atccgatccgggccgggaac gacccgggcg 960 tggcgccggg cggcacgggc ggaggcctcc tctatggcagggagtccgct ccccctccga 1020 cggctccctt caacctggcg gacatcaacg acgtgagcccctcgggcggc ttcgtggccg 1080 agctcctgcg gccagaattg gacccggtgt acattccgccgcagcagccg cagccgccag 1140 gtggcgggct gatgggcaag ttcgtgctga aggcgtcgctgagcgcccct ggcagcgagt 1200 acggcagccc gtcggtcatc agcgtcagca aaggcagccctgacggcagc cacccggtgg 1260 tggtggcgcc ctacaacggc gggccgccgc gcacgtgccccaagatcaag caggaggcgg 1320 tctcttcgtg cacccacttg ggcgctggac cccctctcagcaatggccac cggccggctg 1380 cacacaactt ccccctgggg cggcagctcc ccagcaggagtaccccgacc ctgggttttg 1440 aggaagtgct gagcagcagg gaatgtcacc ctgccctgccgcttcctccc ggcttccatc 1500 cccacccggg gcccaattac ccatccttcc tgcccgatcagatgcagccg caagtcccgc 1560 cgctccatta ccaagagctc atgccacccg gttcctgcatgccagaggag cccaagccaa 1620 agaggggaag acgatcgtgg ccccggaaaa ggaccgccacccacacttgt gattacgcgg 1680 gctgcggcaa aacctacaca aagagttccc atctcaaggcacacctgcga acccacacag 1740 gtgagaaacc ttaccactgt gactgggacg gctgtggatggaaattcgcc cgctcagatg 1800 aactgaccag gcactaccgt aaacacacgg ggcaccgcccgttccagtgc caaaaatgcg 1860 accgagcatt ttccaggtcg gaccacctcg ccttacacatgaagaggcat ttttaaatcc 1920 cagacagtgg atatgaccca cactgccaga aga 1953 861476 DNA Homo sapiens 86 gccacccacc ctccggaccg cggcagctgc tgacccgccatcgccatggc ccgcgggaaa 60 gccaaggagg agggcagctg gaagaaattc atctggaactcagagaagaa ggagtttctg 120 ggcaggaccg gtggcagttg gtttaagatc cttctattctacgtaatatt ttatggctgc 180 ctggctggca tcttcatcgg aaccatccaa gtgatgctgctcaccatcag tgaatttaag 240 cccacatatc aggaccgagt ggccccgcca ggattaacacagattcctca gatccagaag 300 actgaaattt cctttcgtcc taatgatccc aagagctatgaggcatatgt actgaacata 360 gttaggttcc tggaaaagta caaagattca gcccagagggatgacatgat ttttgaagat 420 tgtggcgatg tgcccagtga accgaaagaa cgaggagactttaatcatga acgaggagag 480 cgaaaggtct gcagattcaa gcttgaatgg ctgggaaattgctctggatt aaatgatgaa 540 acttatggct acaaagaggg caaaccgtgc attattataaagctcaaccg agttctaggc 600 ttcaaaccta agcctcccaa gaatgagtcc ttggagacttacccagtgat gaagtataac 660 ccaaatgtcc ttcccgttca gtgcactggc aagcgagatgaagataagga taaagttgga 720 aatgtggagt attttggact gggcaactcc cctggttttcctctgcagta ttatccgtac 780 tatggcaaac tcctgcagcc caaatacctg cagcccctgctggccgtaca gttcaccaat 840 cttaccatgg acactgaaat tcgcatagag tgtaaggcgtacggtgagaa cattgggtac 900 agtgagaaag accgttttca gggacgtttt gatgtaaaaattaaatttta agtgacacta 960 cagaaaaaca caaaaaggtg atgggttgtg ttatgcttgtattgaatgct gtcttgacat 1020 ctcttgcctt gtcctccggt atgttctaaa gctgtgtctgagatctggat ctgcccatca 1080 ctttggctag tgacagggct aattaatttg ctttatacattttcttttac tttccttttt 1140 tcctttctgg aggcatcaca tgctggtgct gtgtctttatgaatgtttta accattttca 1200 tggtggaaga attttatatt tatgcagttg tacaattttatttttttctg caagaaaaag 1260 tgtaatgtat gaaataaacc aaagtcactt gtttgaaaataaatctttat tttgaacttt 1320 ataaaaagca atgcagtacc ccatagactg gtgttaaatgttgtctacag tgcaaaatcc 1380 atgttctaac atatgtaata attgccagga gtacagtgctcttgttgatc ttgtattcag 1440 tcaggttaaa acaacggtca ataaaagaat gaacac 147687 439 DNA Homo sapiens 87 ggtgggtctg aatctagcac catgacggaa ctagagacagccatgggcat gatcatagac 60 gtcttttccc gatattcggg cagcgagggc agcacgcagaccctgaccaa gggggagctc 120 aaggtgctga tggagaagga gctaccaggc ttcctgcagagtggaaaaga caaggatgcc 180 gtggataaat tgctcaagga cctggacgcc aatggagatgcccaggtgga cttcagtgag 240 ttcatcgtgt tcgtggctgc aatcacgtct gcctgtcacaagtactttga gaaggcagga 300 ctcaaatgat gccctggaga tgtcacagat tcctgcagagccatggtccc aggcttccca 360 aaagtgtttg ttggcaatta ttcccctagg ctgagcctgctcatgtacct ctgattaata 420 aatgcttatg aaaaaaaaa 439 88 5431 DNA Homosapiens 88 ggcagccggg cgccccgcgg ggctctccgc gctgcgttcc cgacccctggggggaggtgt 60 ggagtccaag cggtgcattc ttgaaccatc ttgtcagacg ccggcggctcgcgggctgtg 120 gcgggggctg cggtcaaggc cgcgctcctg ggggccgccg cctgggagggtgggcgccca 180 ggcgtccctg cagccccggg tgctccgact gcgcggcggg gccgcggcgcgcgcgcccgg 240 gcgtccgggc gtccgggaca gtggtgccag acactcccaa atcccgagccggcccagcct 300 cgtacggagg accttttttt tggttctgtt ggtgacccgt tagccgccgctggggcctaa 360 caccaagttg agggctcgcg gattagccgc ccgccagccg tggaaatgtgataagagcgg 420 taccgtttgc agaaggaaat ttctgatgca actcttcgcc tttgctgattgcctctccaa 480 acgcctgcct gacgactgcc ttggagcatg tgcgttatgg aaattaggctttggcgctga 540 ccacaatgct gagcaggaag cagcagctgc aggcccagtg actggtagctcagtgaccag 600 cagcccagtg accggcagcc aggtcctcac ctgggtcctc tcagtgaagccagggtggcc 660 gccccagcag acagtgctac agagccaact cctgacaggt tctgaaaatattgtgcacag 720 ggcaggctga ggacacagcc acgtgatacc cactgtagag agagggagagagagacctcc 780 tatgcaagct gccggccctc tgttccgtag taaggacaag gtggagcagacacctcgcag 840 tcaacaagac ccggcaggac caggactccc cgcacagtct gaccgacttgcgaatcacca 900 ggaggatgat gtggacctgg aagccctggt gaacgatatg aatgcatccctggagagcct 960 gtactcggcc tgcagcatgc agtcagacac ggtgcccctc ctgcagaatggccagcatgc 1020 ccgcagccag cctcgggctt caggccctcc tcggtccatc cagccacaggtgtccccgag 1080 gcagagggtg cagcgctccc agcctgtgca catcctcgct gtcaggcgccttcaggagga 1140 agaccagcag tttagaacct catctctgcc ggccatcccc aatccttttcctgaactctg 1200 tggccctggg agccccgctg tgctcacgcc gggttcttta cctccgagccaggccgccgc 1260 aaagcaggat gttaaagtct ttagtgaaga tgggacaagc aaagtggtggagattctagc 1320 agacatgaca gccagagacc tgtgccaatt gctggtttac aaaagtcactgtgtggatga 1380 caacagctgg acactagtgg agcaccaccc gcacctagga ttagagaggtgcttggaaga 1440 ccatgagctg gtggtccagg tggagagtac catggccagt gagagtaaatttctattcag 1500 gaagaattac gcaaaatacg agttctttaa aaatcccatg aatttcttcccagaacagat 1560 ggttacttgg tgccagcagt caaatggcag tcaaacccag cttttgcagaattttctgaa 1620 ctccagtagt tgtcctgaaa ttcaagggtt tttgcatgtg aaagagctgggaaagaaatc 1680 atggaaaaag ctgtatgtgt gtttgcggag atctggcctt tattgctccaccaagggaac 1740 ttcaaaggaa cccagacacc tgcagctgct ggccgacctg gaggacagcaacatcttctc 1800 cctgatcgct ggcaggaagc agtacaacgc ccctacagac cacgggctctgcataaagcc 1860 aaacaaagtc aggaatgaaa ctaaagagct gaggttgctc tgtgcagaggacgagcaaac 1920 caggacgtgc tggatgacag cgttcagact cctcaagtat ggaatgctcctttaccagaa 1980 ttaccgaatc cctcagcaga ggaaggcctt gctgtccccg ttctcgacgccagtgcgcag 2040 tgtctccgag aactccctcg tggcaatgga tttttctggg caaacaggacgcgtgataga 2100 gaatccggcg gaggcccaga gcgcagccct ggaggagggc cacgcctggaggaagcgaag 2160 cacacggatg aacatcctag gtagccaaag tcccctccac ccttctaccctaagtacagt 2220 gattcacagg acacagcact ggtttcacgg gaggatctcc agggaggaatcccacaggat 2280 cattaaacag caagggctcg tggatgggct ttttctcctc cgtgacagccagagtaatcc 2340 aaaggcattt gtactcacac tgtgtcatca ccagaaaatt aaaaatttccagatcttacc 2400 ttgcgaggac gacgggcaga cgttcttcag cctagatgac gggaacaccaaattctctga 2460 cctgatccag ctggttgact tttaccagct gaacaaagga gtcctgccttgcaaactcaa 2520 gcaccactgc atccgagtgg ccttatgacc gcagatgtcc tctcggctgaagactggagg 2580 aagtgaacac tggagtgaag aagcggtctg tgcgttggtg aagaacacacatcgattctg 2640 cacctgggga cccagagcga gatgggtttg ttcggtgcca gccgaccaagattgactagt 2700 ttgttggact taaacgacga tttgctgctg tgaacccagc agggtcgcctccctctgcgt 2760 cagccaaatt ggggagggca tggaagatcc agcggaaagt tgaaaataaactggaatgat 2820 catcttggct tgggccgctt aggaacaaga accggagaga agtgattggaaatgaactct 2880 tgccctggaa taatcttgac aattaaaact gatatgttta ctttttttgtattgatcact 2940 tttttgcact ccttctttgt tttcaatatt gtattcagcc tattgtaggagggggatgtg 3000 gcgtttcaac tcatataata cagaaagagt tttgaatggg cagatttcaaactgaatatg 3060 ggtccccaaa tgttcccaga gggtcctcca caccctctgc cgactaccacggtgtggatt 3120 cagctcccaa atgacaaacc cagcccttcc cagtatactt gaaaagctttcttgttaaaa 3180 taaaaggtgt cactgtggta ggcatttggc atattttgtg gactcagtcaagcaaccaca 3240 gtctgttaat catttctcta tgctcagatg tcagatcctc ttgttattagtgtgtcttgt 3300 tctgcacagt gcaggagact ttattccttt ggaaaattca ctgttccacaaacagcaggc 3360 tgaatggcct cgcctctaga ttgacgtggg ccagcctcct tgagacacacctggcacccg 3420 tcatcggcca gcggtggatg ctgcataatc cacctgggta cttcagccttgcgtttccac 3480 agccttcagc ctgttctaga acgatcactg ccttacccct gctgctgcagtggtgtgagt 3540 cgtttcacgg ctgatgtccc tcgggggatt aaaggatcta aagagaaaatggcacctggt 3600 tgtcttcgtg ctgtgtctca tgggtttcca tagtgataaa gacaaggaaacgctgcaggg 3660 gccacaggca caggctgata tttaaagatc tttgcttgca gccctccgtcctgctgaaaa 3720 cccccataag ccagtgaaca cagagcagct agaggctcct cctctgctggcttagggtca 3780 gaagtacctc acagtggttg tggacatgga agagttttgt caacacaacactttgtcccc 3840 gctccgggag atgagtcaga tggtggcttg agttgtcact tggtcccctccgcccctcgg 3900 gtggccccct ttgccacgtc cccttagctt agtgatcagg tgtgagagtggccatttcct 3960 tacctttgat ccctgtaaag cagaaaggac tcctttgaca ggcgacaaactactgtggtg 4020 agcagaatga tttccttttt caagacaaca cctgcctggc ttctattaatgtgtgctggc 4080 catgatattg ccccaaatcc gccccactga agtgttccct aaggaacagcatttctctgc 4140 tcctcagtca acccccgtag cctagagcag tgtcacaagc ttcagtaaggccagtcagct 4200 ggaagtcagt ctaccgtata gtaacactgt atttcagtct acagaccacactctagttgt 4260 tttccatgaa aggtatacaa atgaagaatt ttctagcaaa acatgtttttaaccatcagt 4320 gctcaattgc attttcttcc tttcgcagcc agtcagtctt tcaaactattgacagtaaga 4380 taattctcac gttcacacct ggtggcaggc ttcactgtag ggacggacattgcagttaca 4440 ccacgattcc ttcctcttca ctggctcgag gtaaaccctt ttcaaggaaaaacaactcta 4500 ggatttcttt tttctgtgta cgtagaccag tcccatcagt gtataatctctctctcacac 4560 gcctctctcc aatagacagc ttgtatttgc agtatttcat atttataaatatgcgtttat 4620 ttaaaaggag aacaaaagct tgactctgat tcacagtttt gtatgtagctggtttgacgt 4680 agtcttttgt attttccctg ccgaagtgaa ttgttggaga atgtaaaccgcctccacgtg 4740 gcggcagact tcctaaggcc ccagctcgct ggcctcgcgc tgggcggctgggaattccac 4800 ctgagaacaa gtcccgcaaa ccggggacgg aaggacattt gacttttatttttgtattta 4860 attgacatga atgtaaaggg gacagctcag ggttgttttg gagcctgttgactttgtatc 4920 tctgcctgtg attttctttt ctaaatgaaa ctccatgtag caaccaggacgaagttgaga 4980 aggaaaacgc caaatgcttt ggttattaga gtttaatagg taagctctgttacactaggt 5040 gttagagttc cagaatgttc ttttgtttgc taaaccttga agaaacatgtgcctcagcct 5100 agatgttttg tcttctcttt tctgcactta atacctgaca gtatgaccgatctctgcgcc 5160 tttctggggg cgggcaagct ggcggtagat ttgtgatgtc acagtgcaaactgcagtgac 5220 tgtaaattgg cctggcgtgt ataaacgttt tcagggaatg cagaaggtattaatgaagag 5280 acaaaacctt tattccatgt gctttgcttc attctgtaca tagctctttggctcgtgaac 5340 ctaattgtaa actttcaggt atttttgtac aaataaggga ctgatgttctgtttcttgta 5400 attagaaata aacattaata cagtgttctt c 5431 89 1223 DNA Homosapiens 89 acactcgctc ggctcaccat gtgtcactct cgcagctgcc acccgaccatgaccatcctg 60 caggccccga ccccggcccc ctccaccatc ccgggacccc ggcggggctccggtcctgag 120 atcttcacct tcgaccctct cccggagccc gcagcggccc ctgccgggcgccccagcggc 180 tctcgcgggc accgaaagcg cagccgcagg gttctctacc ctcgagtggtccggcgccag 240 ctgccagtcg aggaaccgaa cccagccaaa aggcttctct ttctgctgctcaccatcgtc 300 ttctgccaga tcctgatggc tgaagagggt gtgcgggcgc ccctgcctccagaggacgcc 360 cctaacgccg catccctggc gcccacccct gtgtcccccg tcctcgagccctttaatctg 420 acttcggagc cctcggacta cgctctggac ctcagcactt tcctccagcaacacccggcc 480 gccttctaac tgtgactccc cgcactcccc aaaaagaatc cgaaaaaccacaaagaaaca 540 ccaggcgtac ctggtgcgcg agagcgtatc cccaactggg acttccgaggcaacttgaac 600 tcagaacact acagcggaga cgccacccgg tgcttgaggc gggaccgaggcgcacagaga 660 ccgaggcgca tagagaccga gcacagccca gctgggctag gcccggtgggaaggagagcg 720 tcgttaattt atttcttatt gctcctaatt aatatttata tgtatttatgtacgtcctcc 780 taggtgatga gatgtgtacg taatatttat tttaacttat gcaagggtgtgagatgttcc 840 ccctgctgta aatgcaggtc tcttggtatt tattgagctt tgtgggactggtggaagcag 900 gacacctgga actgcggcaa agtaggagaa gaaatgggga ggactcgggtgggggaggac 960 gtcccggctg ggatgaagtc tggtggtggg tcgtaagttt aggaggtgactgcatcctcc 1020 agcattctca actccgtctg tctactgtgt gagacttcgg cggaccattaggaatgagat 1080 ccgtgagatc cttccatctt cttgaagtcg cctttagggt ggctgcgaggtagagggttg 1140 ggggttggtg ggctgtcacg gagcgactgt cgagatcgcc tagtatgttctgtgaacaca 1200 aataaaattg atttactgtc tgc 1223 90 3536 DNA Homo sapiens90 ggcccctcga gcctcgaacc ggaacctcca aatccgagac gctctgctta tgaggacctc 60gaaatatgcc ggccagtgaa aaaatcttat ggctttgagg gcttttggtt ggccaggggc 120agtaaaaatc tcggagagct gacaccaagt cctcccctgc cacgtagcag tggtaaagtc 180cgaagctcaa attccgagaa ttgagctctg ttgattctta gaactggggt tcttagaagt 240ggtgatgcaa gaagtttcta ggaaaggccg gacaccaggt tttgagcaaa attttggact 300gtgaagcaag gcattggtga agacaaaatg gcctcgccgg ctgacagctg tatccagttc 360acccgccatg ccagtgatgt tcttctcaac cttaatcgtc tccggagtcg agacatcttg 420actgatgttg tcattgttgt gagccgtgag cagtttagag cccataaaac ggtcctcatg 480gcctgcagtg gcctgttcta tagcatcttt acagaccagt tgaaatgcaa ccttagtgtg 540atcaatctag atcctgagat caaccctgag ggattctgca tcctcctgga cttcatgtac 600acatctcggc tcaatttgcg ggagggcaac atcatggctg tgatggccac ggctatgtac 660ctgcagatgg agcatgttgt ggacacttgc cggaagttta ttaaggccag tgaagcagag 720atggtttctg ccatcaagcc tcctcgtgaa gagttcctca acagccggat gctgatgccc 780caagacatca tggcctatcg gggtcgtgag gtggtggaga acaacctgcc actgaggagc 840gcccctgggt gtgagagcag agcctttgcc cccagcctgt acagtggcct gtccacaccg 900ccagcctctt attccatgta cagccacctc cctgtcagca gcctcctctt ctccgatgag 960gagtttcggg atgtccggat gcctgtggcc aaccccttcc ccaaggagcg ggcactccca 1020tgtgatagtg ccaggccagt ccctggtgag tacagccggc cgactttgga ggtgtccccc 1080aatgtgtgcc acagcaatat ctattcaccc aaggaaacaa tcccagaaga ggcacgaagt 1140gatatgcact acagtgtggc tgagggcctc aaacctgctg ccccctcagc ccgaaatgcc 1200ccctacttcc cttgtgacaa ggccagcaaa gaagaagaga gaccctcctc ggaagatgag 1260attgccctgc atttcgagcc ccccaatgca cccctgaacc ggaagggtct ggttagtcca 1320cagagccccc agaaatctga ctgccagccc aactcgccca cagaggcctg cagcagtaag 1380aatgcctgca tcctccaggc ttctggctcc cctccagcca agagccccac tgaccccaaa 1440gcctgcaact ggaagaaata caagttcatc gtgctcaaca gcctcaacca gaatgccaaa 1500ccaggggggc ctgagcaggc tgagctgggc cgcctttccc cacgagccta cacggcccca 1560cctgcctgcc agccacccat ggagcctgag aaccttgacc tccagtcccc aaccaagctg 1620agtgccagcg gggaggactc caccatccca caagccagcc ggctcaataa catcgttaac 1680aggtccatga cgggctctcc ccgcagcagc agcgagagcc actcaccact ctacatgcac 1740cccccgaagt gcacgtcctg cggctctcag tccccacagc atgcagagat gtgcctccac 1800accgctggcc ccacgttcgc tgaggagatg ggagagaccc agtctgagta ctcagattct 1860agctgtgaga acggggcctt cttctgcaat gagtgtgact gccgcttctc tgaggaggcc 1920tcactcaaga ggcacacgct gcagacccac agtgacaaac cctacaagtg tgaccgctgc 1980caggcctcct tccgctacaa gggcaacctc gccagccaca agaccgtcca taccggtgag 2040aaaccctatc gttgcaacat ctgtggggcc cagttcaacc ggccagccaa cctgaaaacc 2100cacactcgaa ttcactctgg agagaagccc tacaaatgcg aaacctgcgg agccagattt 2160gtacaggtgg cccacctccg tgcccatgtg cttatccaca ctggtgagaa gccctatccc 2220tgtgaaatct gtggcacccg tttccggcac cttcagactc tgaagagcca cctgcgaatc 2280cacacaggag agaaacctta ccattgtgag aagtgtaacc tgcatttccg tcacaaaagc 2340cagctgcgac ttcacttgcg ccagaagcat ggcgccatca ccaacaccaa ggtgcaatac 2400cgcgtgtcag ccactgacct gcctccggag ctccccaaag cctgctgaag catggagtgt 2460tgatgctttc gtctccagcc ccttctcaga atctacccaa aggatactgt aacactttac 2520aatgttcatc ccatgatgta gtgcctcttt catccactag tgcaaatcat agctgggggt 2580tgggggtggt gggggtcggg gcctggggga ctgggagccg cagcagctcc ccctccccca 2640ctgccataaa acattaagaa aatcatattg cttcttctcc tatgtgtaag gtgaaccatg 2700tcagcaaaaa gcaaaatcat tttatatgtc aaagcagggg agtatgcaaa agttctgact 2760tgactttagt ctgcaaaatg aggaatgtat atgttttgtg ggaacagatg tttcttttgt 2820atgtaaatgt gcattctttt aaaagacaag acttcagtat gttgtcaaag agagggcttt 2880aattttttta accaaaggtg aaggaatata tggcagagtt gtaaatatat aaatatatat 2940atatataaaa taaatatata taaacctaac aaagatatat taaaaatata aaactgcgtt 3000aaaggctcga ttttgtatct gcaggcagac acggatctga gaatctttat tgagaaagag 3060cacttaagag aatattttaa gtattgcatc tgtataagta agaaaatatt ttgtctaaaa 3120tgcctcagtg tatttgtatt tttttgcaag tgaaggttta caatttacaa agtgtgtatt 3180aaaaaaaacc caaagaaccc aaaaatctgc agaaggaaaa atgtgtaatt ttgttctagt 3240tttcagtttg tatatacccg tacaacgtgt cctcacggtg ccttttttca cggaagtttt 3300caatgatggg cgagcgtgca ccatcccttt ttgaagtgta ggcagacaca gggacttgaa 3360gttgttacta actaaactct ctttgggaat gtttgtctca tcccattctg cgtcatgctt 3420gtgtgataac tactccggag acagggtttg gctgtgtcta aactgcatta ccgcgttgta 3480aaaaatagct gtaccaatat aagaataaaa tgttggaaag tcgcaaaaaa aaaaaa 3536 918930 DNA Homo sapiens 91 gaattccgga aagaaagaac atcgtttcag gaataaaaatgcacagtagt agttatagtt 60 accgtagcag tgattctgtg tttagtaaca ctaccagcactcgaaccagt cttgattcaa 120 atgaaaatct tctcttggtt cattgtggtc caacactgatcaactcttgc attagcttcg 180 gcagtgaatc ctttgatgga cacaggttag aaatgttgcaacagattgcc aacagagttc 240 agagggacag tgtcatctgt gaagacaaac tgattcttgctggaaatgct cttcagtctg 300 attctaaaag attagaatca ggagtgcagt ttcagaatgaagcagaaatt gctgggtata 360 tacttgaatg tgagaacctt ttacgccagc atgtaattgatgtacagatt cttattgatg 420 gaaaatacta ccaggcagat caattggtac agagggttgcaaaactgcgt gacgaaatta 480 tggccttaag gaacgaatgt tcttctgtgt acagcaaaggacgcatactg acaacagaac 540 agacaaagct catgatatca ggaatcactc aaagtttaaactcaggattt gcacagacct 600 tacaccctag tctgacctca gggctgaccc agagtttaacaccttcccta acctcttcta 660 gtatgacttc tggcctgtca tcagggatga cttcccgcctgactccatct gtcactccag 720 cttatacacc tggtttccca tcaggattag ttccaaatttcagttcagga gtagagccaa 780 attcattgca aactttgaag ttgatgcaga tccgaaaaccccttctaaag tcttctttgc 840 tggatcaaaa tttaacagaa gaagaaatca atatgaaatttgttcaggat cttttgaatt 900 gggttgatga gatgcaggta caactggacc gcactgagtggggctcagat ttgccaagtg 960 ttgaaagcca tttagaaaat cataaaaatg ttcatagagctattgaagaa tttgaatcta 1020 gtctcaaaga agctaaaatc agtgagattc aaatgacagcacctcttaaa ctgacttatg 1080 cagaaaagtt gcacagatta gagagtcagt atgcaaaactcttgaataca tccaggaatc 1140 aagaacggca ccttgataca ctccataatt ttgtaagtcgtgcgactaat gaacttattt 1200 ggttgaatga aaaagaagag gaggaagttg cttatgactggagtgagaga aacaccaaca 1260 tagctaggaa aaaagattat catgctgaat taatgagagaacttgatcaa aaggaagaaa 1320 atattaaatc agttcaggag atagcagagc agctacttctagaaaatcat ccagcccggt 1380 taactattga ggcctacaga gcggcaatgc agacgcagtggagctggatc ttacagctct 1440 gccagtgtgt ggagcagcac ataaaggaga acacagcgtatttcgagttt ttcaatgatg 1500 ccaaagaagc tactgattac ttaaggaatc taaaagatgccattcagcgg aagtacagct 1560 gtgatagatc aagcagcatt cacaagctag aagaccttgttcaggaatca atggaagaga 1620 aagaagaact tctgcagtac aaaagcacta tagcaaacctaatgggaaaa gcaaaaacaa 1680 taattcaact gaagccaagg aattctgact gtccactcaaaacttctatt ccgatcaaag 1740 ctatctgtga ctacagacaa attgagataa ccatttacaaagacgatgaa tgtgttttgg 1800 caaataactc tcatcgtgct aaatggaagg tcattagtcctactgggaat gaggctatgg 1860 tcccatctgt gtgcttcacc gttcctccac caaacaaagaagcggtggac cttgccaaca 1920 gaattgagca acagtatcag aatgtcctga ctctttggcatgagtctcac ataaacatga 1980 agagtgtagt atcctggcat tatctcatca atgaaattgatagaattcga gctagcaatg 2040 tggcttcaat aaagacaatg ctacctggtg aacatcagcaagttctaagt aatctacaat 2100 ctcgttttga agattttctg gaagatagcc aggaatcccaagtcttttca ggctcagata 2160 taacacaact ggaaaaggag gttaatgtat gtaagcagtattatcaagaa cttcttaaat 2220 ctgcagaaag agaggagcaa gaggaatcag tttataatctctacatctct gaagttcgaa 2280 acattagact tcggttagag aactgtgaag atcggctgattagacagatt cgaactcccc 2340 tggaaagaga tgatttgcat gaaagtgtgt tcagaatcacagaacaggag aaactaaaga 2400 aagagctgga acgacttaaa gatgatttgg gaacaatcacaaataagtgt gaggagtttt 2460 tcagtcaagc agcagcctct tcatcagtcc ctaccctacgatcagagctt aatgtggtcc 2520 ttcagaacat gaaccaagtc tattctatgt cttccacttacatagataag ttgaaaactg 2580 ttaacttggg gttaaaaaac actcaagctg cagaagccctcgtaaaactc tatgaaacta 2640 aactgtgtga agaagaagca gttatagctg acaagaataatattgagaat ctaataagta 2700 ctttaaagca atggagatct gaagtagatg aaaagagacaggtattccat gccttagagg 2760 atgagttgca gaaagctaaa gccatcagtg atgaaatgtttaaaacgtat aaagaacggg 2820 accttgattt tgactggcac aaagaaaaag cagatcaattagttgaaagg tggcaaaatg 2880 ttcatgtgca gattgacaac aggttacggg acttagagggcattggcaaa tcactgaagt 2940 actacagaga cacttaccat cctttagatg attggatccagcaggttgaa actactcaga 3000 gaaagattca ggaaaatcag cctgaaaata gtaaaaccctagccacacag ttgaatcaac 3060 agaagatgct ggtgtccgaa atagaaatga aacagagcaaaatggacgag tgtcaaaaat 3120 atgcagaaca gtactcagct acagtgaagg actatgaattacaaacaatg acctaccggg 3180 ccatggtaga ttcacaacaa aaatctccag tgaaacgccgaagaatgcag agttcagcag 3240 atctcattat tcaagagttc atggacctaa ggactcgatatactgccctg gtcactctca 3300 tgacacaata tattaaattt gctggtgatt cattgaagaggctggaagag gaggagatta 3360 aaaggtgtaa ggagacttct gaacatgggg catattcagatctgcttcag cgtcagaagg 3420 caacagtgct tgagaatagc aaacttacag gaaagataagtgagttggaa agaatggtag 3480 ctgaactaaa gaaacaaaag tcccgagtag aggaagaacttccgaaggtc agggaggctg 3540 cagaaaatga attgagaaag cagcagagaa atgtagaagatatctctctg cagaagataa 3600 gggctgaaag tgaagccaag cagtaccgca gggaacttgaaaccattgtg agagagaagg 3660 aagccgctga aagagaactg gagcgggtga ggcagctcaccatagaggcc gaggctaaaa 3720 gagctgccgt ggaagagaac ctcctgaatt ttcgcaatcagttggaggaa aacaccttta 3780 ccagacgaac actggaagat catcttaaaa gaaaagatttaagtctcaat gatttggagc 3840 aacaaaaaaa taaattaatg gaagaattaa gaagaaagagagacaatgag gaagaactct 3900 tgaagctgat aaagcagatg gaaaaagacc ttgcatttcagaaacaggta gcagagaaac 3960 agttgaaaga aaagcagaaa attgaattgg aagcaagaagaaaaataact gaaattcagt 4020 atacatgtag agaaaatgca ttgccagtgt gtccgatcacacaggctaca tcatgcaggg 4080 cagtaacggg tctccagcaa gaacatgaca agcagaaagcagaagaactc aaacagcagg 4140 tagatgaact aacagctgcc aatagaaagg ctgaacaagacatgagagag ctgacatatg 4200 aacttaatgc cctccagctt gaaaaaacgt catctgaggaaaaggctcgt ttgctaaaag 4260 ataaactaga tgaaacaaat aatacactca gatgccttaagttggagctg gaaaggaagg 4320 atcaggcgga gaaagggtat tctcaacaac tcagagagcttggtaggcaa ttgaatcaaa 4380 ccacaggtaa agctgaagaa gccatgcaag aagctagtgatctcaagaaa ataaagcgca 4440 attatcagtt agaattagaa tctcttaatc atgaaaaagggaaactacaa agagaagtag 4500 acagaatcac aagggcacat gctgtagctg agaagaatattcagcattta aattcacaaa 4560 ttcattcttt tcgagatgag aaagaattag aaagactacaaatctgccag agaaaatcag 4620 atcatctaaa agaacaattt gagaaaagcc atgagcagttgcttcaaaat atcaaagctg 4680 aaaaagaaaa taatgataaa atccaaaggc tcaatgaagaattggagaaa agtaatgagt 4740 gtgcagagat gctaaaacaa aaagtagagg agcttactaggcagaataat gaaaccaaat 4800 taatgatgca gagaattcag gcagaatcag agaatatagttttagagaaa caaactatcc 4860 agcaaagatg tgaagcactg aaaattcagg cagatggttttaaagatcag ctacgcagca 4920 caaatgaaca cttgcataaa cagacaaaaa cagagcaggattttcaaaca aaaattaaat 4980 gcctagaaga agacctggcg aaaagtcaaa atttggtaagtgaatttaag caaaagtgtg 5040 accaacagaa cattatcatc cagaatacca agaaagaagttagaaatctg aatgcggaac 5100 tgaatgcttc caaagaagag aagcgacgcg gggagcagaaagttcagcta caacaagctc 5160 aggtgcaaga gttaaataac aggttgaaaa aagtacaagacgaattacac ttaaagacca 5220 tagaggagca gatgacccac agaaagatgg ttctgtttcaggaagaatct ggtaaattca 5280 aacaatcagc agaggagttt cggaagaaga tggaaaaattaatggagtcc aaagtcatca 5340 ctgaaaatga tatttcaggc attaggcttg actttgtgtctcttcaacaa gaaaactcta 5400 gagcccaaga aaatgctaag ctttgtgaaa caaacattaaagaacttgaa agacagcttc 5460 aacagtatcg tgaacaaatg cagcaagggc agcacatggaagcaaatcat taccaaaaat 5520 gtcagaaact tgaggatgag ctgatagccc agaagcgtgaggttgaaaac ctgaagcaaa 5580 aaatggacca acagatcaaa gagcatgaac atcaattagttttgctccag tgtgaaattc 5640 aaaaaaagag cacagccaaa gactgtacct tcaaaccagattttgagatg acagtgaagg 5700 agtgccagca ctctggagag ctgtcctcta gaaacactggacaccttcac ccaacaccca 5760 gatcccctct gttgagatgg actcaagaac cacagccattggaagagaag tggcagcatc 5820 gggttgttga acagataccc aaagaagtcc aattccagccaccaggggct ccactcgaga 5880 aagagaaaag ccagcagtgt tactctgagt acttttctcagacaagcacc gagttacaga 5940 taacttttga tgagacaaac cccattacaa gactgtctgaaattgagaag ataagagacc 6000 aagccctgaa caattctaga ccacctgtta ggtatcaagataacgcatgt gaaatggaac 6060 tggtgaaggt tttgacaccc ttagagatag ctaagaacaagcagtatgat atgcatacag 6120 aagtcacaac attaaaacaa gaaaagaacc cagttcccagtgctgaagaa tggatgcttg 6180 aagggtgcag agcatctggt ggactcaaga aaggggatttccttaagaag ggcttagaac 6240 cagagacctt ccagaacttt gatggtgatc atgcatgttcagtcagggat gatgaattta 6300 aattccaagg gcttaggcac actgtgactg ccaggcagttggtggaagct aagcttctgg 6360 acatgagaac aattgagcag ctgcgactcg gtcttaagactgttgaagaa gttcagaaaa 6420 ctcttaacaa gtttctgacg aaagccacct caattgcagggctttaccta gaatctacaa 6480 aagaaaagat ttcatttgcc tcagcggccg agagaatcataatagacaaa atggtggctt 6540 tggcattttt agaagctcag gctgcaacag gttttataattgatcccatt tcaggtcaga 6600 catattctgt tgaagatgca gttcttaaag gagttgttgaccccgaattc agaattaggc 6660 ttcttgaggc agagaaggca gctgtgggat attcttattcttctaagaca ttgtcagtgt 6720 ttcaagctat ggaaaataga atgcttgaca gacaaaaaggtaaacatatc ttggaagccc 6780 agattgccag tgggggtgtc attgaccctg tgagaggcattcgtgttcct ccagaaattg 6840 ctctgcagca ggggttgttg aataatgcca tcttacagtttttacatgag ccatccagca 6900 acacaagagt tttccctaat cccaataaca agcaagctctgtattactca gaattactgc 6960 gaatgtgtgt atttgatgta gagtcccaat gctttctgtttccatttggg gagaggaaca 7020 tttccaatct caatgtcaag aaaacacata gaatttctgtagtagatact aaaacaggat 7080 cagaattgac cgtgtatgag gctttccaga gaaacctgattgagaaaact atatatcttg 7140 aactttcagg gcagcaatat cagtggaagg aagctatgttttttgaatcc tatgggcatt 7200 cttctcatat gctgactgat actaaaacag gattacacttcaatattaat gaggctatag 7260 agcagggaac aattgacaaa gccttggtca aaaagtatcaggaaggcctc atcacactta 7320 cagaacttgc tgattctttg ctgagccggt tagtccccaagaaagatttg cacagtcctg 7380 ttgcagggta ttggctgact gctagtgggg aaaggatctctgtactaaaa gcctcccgta 7440 gaaatttggt tgatcggatt actgccctcc gatgccttgaagcccaagtc agtacagggg 7500 gcataattga tcctcttact gtcaaaaagt accgggtggccgaagctttg catagaggcc 7560 tggttgatga ggggtttgcc cagcagctgc gacagtgtgaattagtaatc acagggattg 7620 gccatcccat cactaacaaa atgatgtcag tggtggaagctgtgaaggca aatattataa 7680 ataaggaaat gggaatccga tgtttggaat ttcagtacttgacaggaggg ttgatagagc 7740 cacaggttca ctctcggtta tcaatagaag aggctctccaagtaggtatt atagatgtcc 7800 tcattgccac aaaactcaaa gatcaaaagt catatgtcagaaatataata tgccctcaga 7860 caaaaagaaa gttgacatat aaagaagcct tagaaaaacctgattttgat ttccacacag 7920 gacttaaact gttagaagta tctgagcccc tgatgacaggaatttctagc ctctactatt 7980 cttcctaatg ggacatgttt aaataactgt gcaaggggtgatgcaggctg gttcatgcca 8040 ctttttcaga gtatgatgat atcggctaca tatgcagtctgtgaattatg taacatactc 8100 tatttcttga gggctgcaaa ttgctaagtg ctcaaaatagagtaagtttt aaattgaaaa 8160 ttacataaga tttaatgccc ttcaaatggt ttcatttagccttgagaatg gttttttgaa 8220 acttggccac actaaaatgt tttttttttt acgtagaatgtgggataaac ttgatgaact 8280 ccaagttcac agtgtcattt cttcagaact ccccttcattgaatagtgat catttattaa 8340 atgataaatt gcactcgctg aaagagcacg tcatgaagcaccatggaatc aaagagaaag 8400 atataaattc gttcccacag ccttcaagct gcagtgttttagattgcttc aaaaaatgaa 8460 aaagttttgc ctttttctgt atatagtgac cttctttgcatattaaaatg tttaccacaa 8520 tgtcccattt ctagttaagt cttcgcactt gaaagctaacattatgaata ttatgtgttg 8580 gaggagggga aggattttct tcattctgtg tattttccttacatgtacag tagacgttct 8640 ctattctatc agccttctat ggtacctttt tgtcaggacaattaggattg taatgctaat 8700 gcaaaggcag caattcaaag atcttctagt gcctcatgaataaagttgag atttaaaatt 8760 tgtaacattg atggaacagc tgggaggtta gaccaatcattaaggaatgt atgccatacc 8820 tttctttgct accataaaca ttttggaggt gcatctgctatgtgacatgg taaatatggt 8880 taagtgaatg aataaaatgt tttagtaacc tgtgtcggattccgcggaat 8930 92 1675 DNA Homo sapiens 92 gtgagacaga gacaaatgaacccccctcta aagtcattta actaatagcc agcacatccc 60 ttccccaaac tgtcaattgaaatcttaact gaaagtttta ctgaataata ccaagctaat 120 tgctgttggg cacacctggatggctttgca cctggtgttg aacctgctga agcaggtgga 180 tgctcaagat tacgtgcaaggaatccctcc catctggtac taaaatttca gtgtgttctg 240 agtgtctttt aaaccaaaatggaaatacag atacagggct gtagtattca gtaatgtgtc 300 tgctccttgt tgggcagacaccagcggtgt gcagggagag accaagtacc atctttatct 360 acacttgggc tggcttgtggagaagggctg ctttttttca gtcctacatt ccttcatttt 420 ttttttcatt cttgaattcattgttttgtg ggatctaaga cccaggggtc atttgagagg 480 tttgacagta tcttttctgaccagttgcca catgacttgc ttgaccctga gcctgtggaa 540 atggcatagg gaccagtctactacccactg ggcctggtgt gtagaggggg agagggtagc 600 aaggtgcttc tctacgcccatgacttggga gcaggtcttg gcctccttca tgagagtcta 660 gtgccatgtc ctgtcccatgatctggaccc tgggactgtc ttggcatctt aactgcagtt 720 tcaatgaggc agagggcaaagagagaccaa gatcagaggg gttcattata cccctggcta 780 gagaacccag ctactgacatgcaagcagct tggggctggc tggacacagg tactaggccc 840 attgtttcca ggtgaagctttcatcacaga acagtgttgt ctccacctgg ccttagatgg 900 cacgccatga ttcgggcctggatagactgc ctgcgtcctt accactgatc tggccaagaa 960 tgaggccctc ccaacactttcactccctct ccaagccttg atgggacctc cacttattta 1020 ggcctcatgt gctttgaagaagctttgaga gccaatgtgt cttccacggg tctctttttt 1080 gctacaagta atcagccccatgtgttctct taaactgaga attgcacctg ggcaattcct 1140 gttttctaag gtggtctctgctgctattta acaacccaga gtaggcctct gtgaggcttc 1200 agtggcctca gaaaccagagggtccagata gggggcctgc ttgggccctc tgctgccaac 1260 tgctcaaacc tgctttagctccagccactt gtggcaaaca acctcgtttc cttacaaatt 1320 ccagcatgtg actttggtgccgttacttgt gaaaaatcta ttctgttgtc tttgatgtgt 1380 ccaagaaaat tcgtgtagtttacgtaaaaa tatctgactc acaagaaagc caactgtatg 1440 tcttgtgatg ggacagttcataatgtagtt gctagaccac tttacaaatt gttcttgtca 1500 ccagatgtgt tcagacattgctgtgcaatt gttggggagg gtagggggaa aggcgagagg 1560 agatacttat tggtctttttgtttaatacc ttccccaaga ggggacagtc tggccaactt 1620 gctccagtaa tgcaataaagacattgcaat aaagtaaaaa aaaaaaaaaa aaaaa 1675 93 4180 DNA Homo sapiens 93ccagggtgat gctgaagatg atgaccttct tccaaggcct ctagagccat cagcctgtgc 60caggcaccct cgacttgcct agaggccccc aaaagttgca gtccacatca gaggcagagt 120cagaggcctc catgtcggag gcctcctctg aggacctggt gccacccctg gaggctgggg 180cagccccata tagggaggag gaagaggcgg cgaagaagaa gaaggagaag aagaagaagt 240ccaaaggcct ggccaatgtg ttctgcgtct tcaccaaagg gaagaagaag aagggtcagc 300ccagctcagc ggagcccgag gacgcagccg ggtccaggca ggggctggat ggcccgcccc 360ccacagtgga ggagctgaag gcggcgctgg agcgcgggca gctggaggcg gcgcggccgc 420tgctggcgct ggagcgggag ctggcggcgg cggcggcggc gggcggtgtg agcgaggagg 480agctggtgcg gcgccagagc aaggtggagg cgctgtacga gctgctgcgc gaccaggtgc 540tgggcgtgct gcggcggccg ctggaggcgc cgcccgagcg gctgcgccag gcgctggccg 600tggtggcgga gcaggagcgc gaggaccgcc aggcggcggc ggcggggccg gggacctcgg 660ggctggcggc cacgcgcccg cggcgctggc tgcagctgtg gcggcgcggc gtggcggagg 720cggccgagga gcgcatgggc cagcggccgg ccgcgggcgc cgaggtcccc gagagcgtct 780ttctgcactt gggccgcacc atgaaggagg acctggaggc cgtggtggag cggctgaagc 840cgctgttccc cgccgagttc ggcgtcgtgg cggcctacgc cgagagctac caccagcact 900tcgcggccca cctggccgcc gtggcgcagt tcgagctgtg cgagcgcgac acctacatgc 960tgctgctctg ggtggagaac ctctacccca atgacatcat caacagcccc aagctggtgg 1020gtgagctgca gggtatgggg ctcgggagcc tcctgccccc caggcagatc cgactgctgg 1080aggccacatt cctgtccagt gaggcggcca atgtgaggga gttgatggac cgagctctgg 1140agctagaggc acggcgctgg gctgaggatg tgcctcccca gaggctggac ggccactgcc 1200acagcgagct ggccatcgac atcatccaga tcacctccca ggcccaggcc aaggccgaga 1260gcatcacgct ggacttgggc tcacagataa agcgggtgct gctggtggag ctgcctgcgt 1320tcctgaggag ctaccagcgc gcctttaatg aatttctgga gagaggcaag cagctgacga 1380attacagggc caatgttatt gccaacatca acaactgcct gtccttccgg atgtccatgg 1440agcagaattg gcaggtaccc caggacaccc tgagcctcct gctgggcccc ctgggtgagc 1500tcaagagcca cggctttgac accctgctcc agaacctgca tgaggacctg aagccactgt 1560tcaagaggtt cacgcacacc cgctgggcgg cccctgtgga gaccctggaa aacatcatcg 1620ccactgtaga cacgaggctg cctgagttct cagagctgca gggctgtttc cgggaggagc 1680tcatggaggc cttgcacctg cacctggtga aggagtacat catccaactc agcaaggggc 1740gcctggtcct caagacggcc gagcagcagc agcagctggc tgggtacatc ctggccaatg 1800ctgacaccat ccagcacttc tgcacccagc acggctcccc ggcgacctgg ctgcagcctg 1860ctctccctac gctggccgag atcattcgcc tgcaggaccc cagtgccatc aagattgagg 1920tggccactta tgccacctgc taccctgact tcagcaaagg ccacctgagc gctatcctgg 1980ccatcaaggg gaacctatcc aacagtgagg tcaagcgcat ccggagcatc ttggacgtca 2040gcatgggggc gcaggagccc tcccggcccc tattttccct tataaaggtt ggttagcttt 2100tcctgtggcc tgacctgcct gtgagtgccc agcaagcctt gggcacaccc cgctgggagc 2160tgttaagagc agcgctggtt ctcggttcct cccgggtctc ctgtgctctg atgctacttc 2220tgcctagccc tggcggaggt gcaggccctg tcagctggaa ctggacagac cttggtttgt 2280ttacatgtcc gatgggggca ggagctccca tcctgggcag ccaaccaggc aacaccaagg 2340actctttgta aacgatagct gatcgtgtgc acgcaaggaa agaaccagga gggagagtgc 2400agccaggctc agggatcccc ggacacctct gtccagagcc cctccacagt cggcctcatg 2460actgtcctcc tcgtgggtgg ggccgagggc cctcttcagc tctctggaga caggggccga 2520gcctcaccca tctgccctct gcagcccagg gccgccgtga gcgggattca gcaatggtgg 2580aatggaagac agaactggaa gagaaagaag gaaaagatga gctctcgtct ggcaggggct 2640tttagggtcc tgtggcgagc tgtgagcacc gccagcgtta gacgtcacat ccaggtggcc 2700ccacggcccc tacaggctgg ccctgcaatg gggccctgag ccctccctct tcatccccca 2760aggcctcaac tagagggtgg tcccccgagg gcttggtgtc tactaccgaa gggcccaaga 2820cctcctgggt cctctcaggc tcccccttcc ccaaggcagg gacaggccct gggggtgcca 2880ccgtgggccc tgccacccag aagtctggct gaggtctggg caggggcagg gcaagcttga 2940cctctcactg ttgacccttt ggcctctgta tttgtttcct attgccgtga caggtttcca 3000caaacttcgt ggatcaaaac gaggtcttcc agttctgcgg gtcagaaggc tgacctgggg 3060ctcaaatctg ggtgtcggca gtcctgcact ccttctggag gctctagggg agaattcatt 3120tctggccttt tcatttttag aggctgaccg taattcttga cttcaggctc ctccatcttc 3180agagccagct gtgggtagtt gaatcttttt cccgtcacct cattgaggcc tcccctctcc 3240tgcctccctc caccactttt tttttttttt ttttgagaca gggtcttgct gtgttgccca 3300ggctggagtg cagtggcctg gtcatggcat caaggctcac tgcagcctgg acctcctggt 3360tcaagtgatc ctcttgtctc agtcccctga gacaatcccc cacgcccagc tacatatttt 3420tgtggataca gggtctcatt ctgttgccta ggcttgtctg gaactcctgg gctcaaggga 3480tcttgtagcc ttagcctcct aaagtgctgg gattataggc atgagtcact cgtacccggc 3540ctgctctacc gcttttaagg acgcttatga tcacattgcg cctacccaga gaacccaggt 3600cgtctttcta ttttcaggtc agctgattag ccaccttagt tccatctgca actttagttc 3660ccactggctg tgtaacctaa catagtcaca ggctctgggg actgtcacgt ggacatcttt 3720gggaggccgt tattctgccc accgcaccct ccgttcatcc cctgccctgc cgggcacctc 3780gctctacccc aggaaaatgt gagctcgttt tcctgctcgg catgtgctcc ccctaaggct 3840ctgctcctcc ctgggcctga aagttccttc tcagcctgag agggggccct tcgatctcag 3900gcatgactca gcccggctga tgcctctgca gtgctgagtc aggatttggg gccggctctc 3960ttgggtctgt ccccttttcc caggtactgc cttacaaagc tgtggccagg aagtggccgg 4020tataaaggat gcccaaggtc tttgtacgtg tgtaggagtt agcgtgtttg atattgttaa 4080tataataata attatttttt agagtactgc ttttgtatgt atgttgaaca ggatccaggt 4140ttttatagct tgatataaaa cagaattcaa aagtgaaaaa 4180 94 1897 DNA Homosapiens 94 gacgagagaa agcgagtgtc cctctcgcgc cccaggccgg tgtacccccgcactccgcgc 60 cccggcctag aagctctctc tccccgctcc ccggcccggc ccccgccccgccccgcccca 120 gcccgctggc gccatggagc gctggccttg gccgtcgggc ggcgcctggctgctcgtggc 180 tgcccgcgcg ctgctgcagc tgctgcgctc agacctgcgt ctgggccgcccgctgctggc 240 ggcgctggcg ctgctggccg cgctcgactg gctgtgccag cgcctgctgcccccgccggc 300 cgcactcgcc gtgctggccg ccgccggctg gatcgcgttg tcccgcctggcgcgcccgca 360 gcgcctgccg gtggccactc gcgcggtgct catcaccggc tgtgactctggttttggcaa 420 ggagacggcc aagaaactgg actccatggg cttcacggtg ctggccaccgtattggagtt 480 gaacagcccc ggtgccatcg agctgcgtac ctgctgctcc cctcgcctaaggctgctgca 540 gatggacctg accaaaccag gagacattag ccgcgtgcta gagttcaccaaggcccacac 600 caccagcacc ggcctgtggg gcctcgtcaa caacgcaggc cacaatgaagtagttgctga 660 tgcggagctg tctccagtgg ccactttccg tagctgcatg gaggtgaatttctttggcgc 720 gctcgagctg accaagggcc tcctgcccct gctgcgcagc tcaaggggccgcatcgtgac 780 tgtggggagc ccagcggggg acatgccata tccgtgcttg ggggcctatggaacctccaa 840 agcggccgtg gcgctactca tggacacatt cagctgtgaa ctccttccctggggggtcaa 900 ggtcagcatc atccagcctg gctgcttcaa gacagagtca gtgagaaacgtgggtcagtg 960 ggaaaagcgc aagcaattgc tgctggccaa cctgcctcaa gagctgctgcaggcctacgg 1020 caaggactac atcgagcact tgcatgggca gttcctgcac tcgctacgcctggccatgtc 1080 cgacctcacc ccagttgtag atgccatcac agatgcgctg ctggcagctcggccccgccg 1140 ccgctattac cccggccagg gcctggggct catgtacttc atccactactacctgcctga 1200 aggcctgcgg cgccgcttcc tgcaggcctt cttcatcagt cactgtctgcctcgagcact 1260 gcagcctggc cagcctggca ctaccccacc acaggacgca gcccaggacccaaacctgag 1320 ccccggccct tccccagcag tggctcggtg agccatgtgc acctatggcccagccactgc 1380 agcacaggag gctccgtgag cccttggttc ctccccgaaa acccccagcattacgatccc 1440 ccaagtgtcc tggaccctgg cctaaagaat cccaccccca cttcatgcccactgccgatg 1500 cccaatccag gcccggtgag gccaaggttt cccagtgagc ctctgcgcctctccactgtt 1560 tcatgagccc aaacaccctc ctggcacaac gctctaccct gcagcttggagaactccgct 1620 ggatggggag tctcatgcaa gacttcactg cagcctttca caggactctgcagatagtgc 1680 ctctgcaaac taaggagtga ctaggtgggt tggggacccc ctcaggattgtttctcggca 1740 ccagtgcctc agtgctgcaa ttgagggcta aatcccaagt gtctcttgactggctcaaga 1800 attagggccc caactacaca cccccaagcc acagggaagc atgtactgtacttcccaatt 1860 gccacatttt aaataaagac aaatttttat ttcttct 1897 95 2291DNA Homo sapiens 95 gaacaatgaa gaaagcccca cagccactgt tgctgagcagggagaggata ttacctccaa 60 aaaagacagg ggagtattaa agattgtcaa aagagtggggaatggtgagg aaacgccgat 120 gattggagac aaagtttatg tccattacaa aggaaaattgtcaaatggaa agaagtttga 180 ttccagtcat gatagaaatg aaccatttgt ctttagtcttggcaaaggcc aagtcatcaa 240 ggcatgggac attggggtgg ctaccatgaa gaaaggagagatatgccatt tactgtgcaa 300 accagaatat gcatatggct cggctggcag tctccctaaaattccctcga atgcaactct 360 cttttttgag attgagctcc ttgatttcaa aggagaggatttatttgaag atggaggcat 420 tatccggaga accaaacgga aaggagaggg atattcaaatccaaacgaag gagcaacagt 480 agaaatccac ctggaaggcc gctgtggtgg aaggatgtttgactgcagag atgtggcatt 540 cactgtgggc gaaggagaag accacgacat tccaattggaattgacaaag ctctggagaa 600 aatgcagcgg gaagaacaat gtattttata tcttggaccaagatatggtt ttggagaggc 660 agggaagcct aaatttggca ttgaacctaa tgctgagcttatatatgaag ttacacttaa 720 gagcttcgaa aaggccaaag aatcctggga gatggataccaaagaaaaat tggagcaggc 780 tgccattgtc aaagagaagg gaaccgtata cttcaagggaggcaaataca tgcaggcggt 840 gattcagtat gggaagatag tgtcctggtt agagatggaatatggtttat cagaaaagga 900 atcgaaagct tctgaatcat ttctccttgc tgcctttctgaacctggcca tgtgctacct 960 gaagcttaga gaatacacca aagctgttga atgctgtgacaaggcccttg gactggacag 1020 tgccaatgag aaaggcttgt ataggagggg tgaagcccagctgctcatga acgagtttga 1080 gtcagccaag ggtgactttg agaaagtgct ggaagtaaacccccagaata aggctgcaag 1140 actgcagatc tccatgtgcc agaaaaaggc caaggagcacaacgagcggg accgcaggat 1200 atacgccaac atgttcaaga agtttgcaga gcaggatgccaaggaagagg ccaataaagc 1260 aatgggcaag aagacttcag aaggggtcac taatgaaaaaggaacagaca gtcaagcaat 1320 ggaagaagag aaacctgagg gccacgtatg acgccacgccaaggagggaa gagtcccagt 1380 gaactcggcc cctcctcaat gggctttccc ccaactcaggacagaacagt gtttaatgta 1440 aagtttgtta tagtctatgt gattctggaa gcaaatggcaaaaccagtag cttcccaaaa 1500 acagcccccc tgctgctgcc cggagggttc actgaggggtggcacgggac cactccaggt 1560 ggaacaaaca gaaatgactg tggtgtggag ggagtgagccagcagcttaa gtccagctca 1620 tttcagtttc tatcaacctt caagtatcca attcagggtccctggagatc atcctaacaa 1680 tgtggggctg ttaggtttta cctttgaact ttcatagcactgcagaaacc ttttaaaaaa 1740 aaatgcttca tgaatttctc ctttcctaca gttgggtagggtaggggaag gaggataagc 1800 ttttgttttt taaatgactg aagtgctata aatgtagtctgttgcatttt taaccaacag 1860 aacccacagt agaggggtct catgtctccc cagttccacagcagtgtcac agacgtgaaa 1920 gccagaacct cagaggccac ttgcttgctg acttagcctcctcccaaagt ccccctcctc 1980 agccagcctc cttgtgagag tggctttcta ccacacacagcctgtccctg ggggagtaat 2040 tctgtcattc ctaaaacacc cttcagcaat gataatgagcagatgagagt ttctggatta 2100 gcttttccta ttttcgatga agttctgaga tactgaaatgtgaaaagagc aatcagaatt 2160 gtgctttttc tcccctcctc tattcctttt agggaataatattcaataca cagtacttcc 2220 tcccagaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaaaaaaaaaaaa aaaaaaaaaa 2280 aaaaaaaaaa a 2291 96 15571 DNA Homo sapiens96 aagcttcctc actccttggc acctggctcc gacatcacat tgacttttcc cttcctgctt 60ctaccatcac atcaccttct tctgactcca atctcctgcc tctttcttgc aaggatcctt 120gtgattgtaa ttaggaccca gctggataat ccatgacaat ctcttcaaga tccttaactt 180aatcacatct gcaaagtccc ttttgtcata gaacgataac attcacaggt tctgggtatt 240aggacacgga taacttcggg gttccattac tcacccataa ctggtatgca gtgctgattt 300ccatcctgta ggtacggttt agggatctct aggtcaatga gataatggac tcttgctcat 360gttacatggc ataatgggaa gaaagccaaa cctagaaaaa gagggactca ggttcccttg 420ttagagcctc ttcttactaa ctgtgggatt aggggctgat tccctgacct gctgtgttct 480gtcttctctc caattcaatg ggaatgaact gtgagggcac tgagcaaaaa ctaaggtctc 540aatacctagt agtagtggga cttgcctctg gatacccagt agtgatcctg ccgcctgttt 600ctggatatcc tacagtagca atcccacctg tttctggata tcctacagta gtgatcctgc 660ctgtttctgg atatcctaca gtagcaatcc cgcctgtttc tggatatcct acagtagtga 720tcctgcctgc ttctggatac ccagaagtga tcatgcctgg ttctagatac ccagtagtga 780ttgtgtttcc tctagatacc cagtaggtat tgtgcttgct tctagagatg tagtagtagc 840tggacttagc tctagatacc cagtgctcat ctctagactg ggctgagatc agtgtctccc 900ttgaagggtt attgtaagga tgaaaaaaga taatgcgttt aaagcacttg gtgtagtagg 960tggtcttttt aaaagtgtga ataaatacta gttcttatta tttctgtgga tatccaacag 1020ccacataatt gggccccaaa gccatgaaga aggaagagga aatgtcttaa aggttgtcga 1080tggacagtgt ttgctgaaca tcaaaatcac tttccaggta ttacctctga tttgctctac 1140caactccaca ccccacctgc agccacataa ccttccatga tcacggccat gcacaacaca 1200ccatgtcccc caggcaaggg gaccttagaa acataaccag gcttgagaca gcactctgca 1260ccggtgtctt ggaaatgctc ttaagagtgt atggctgagt tagggaacca ggatttcaaa 1320gtagaaaggg agaatctacc caagcccata gaaatcctga atccactcct ttctcagcaa 1380caagcactgg cctgggagtc agccacttat gcaccaaccc cactctgccc ctaattaaat 1440gcatgacttt gaaaattccc ctcattcttc tgagccccaa ttcagtgatt ggtgcaatca 1500caggcttggc tacagtgacc cattcattgc aggcatggtg agactctcaa tccctctcat 1560ttccactaga atctaactgt tgggatctat gacccagtca gcatagcagg cctgtgggga 1620gctctcaggt tcaagcatat gcccccccta atctacaaga aattagctgc agaaaaccaa 1680ggaatagaac ctggaaaaag agagggtttg ctagagctgt ccctttccct gtctctggaa 1740tgccaacaat agggaggctc tttggtcttg tctctcagga gtgcccatgc cattccagga 1800aaatgatggc ccagctggtg gtgtaaggct tggggggcag cgagtgggca tcgtggtgaa 1860agcctcggga tcagggagct gcgtctgcag gcaggcctgc tggccggaaa cctgccagga 1920aaggaagggg ctgtctcggg gcggggccag ggaggggtgg agacagggcc ggctgtggtc 1980agtgacaaat gctggctgca atccagccag ccctctgccc tttctgagcc cgagggactg 2040ccacctccac tgtgtgcaca ctcagctacg ggacacagta agtaccgatg ccgcaaaggg 2100aggtccccag ggcttgaggg catgtgaggc gaggagagga tggactctag agttttgggg 2160tttggggtct gcaaagctct gaaggagtct catctctgca gtttcaggta tccaaggcag 2220cagaggtgag tgggtccccc gagctctgtg accttatgct ccacactaac tctggcagag 2280cctccgtttc ctcataggta agatggaaat aattacaccc tctggatggt gtgactgaag 2340attaaataca gcgggtgctc tcactcagca catctggcca tgtctgcaga cacatttggt 2400tgccacaact ggaagggggg tgggggttag tgacatctag aggccagcga tgctgctgat 2460gatcccacaa tgcccaggac aagatcacaa agcatcatcc tgttcaaaag gtcaacagga 2520tcaaggttga gagaccctga aataaggcca tggggacaaa atgtcggctg gataggaggt 2580gctcagtaag tggcagcttc tgttgttttc tgtgcctgga gtcttggggc tttagaaatc 2640aggaacaatg atccaatatt atcggcttcc gtgagataag ggcatcttgc ctggaggctg 2700ccacccaggc cggtcatggc agctgctcat gaaggacagt aacaatttgg cagtttgtta 2760aatgaacaaa atgtagaaat aaagtaagca gaatttttag tttttctgaa ggtagggctt 2820ttggccagat atgcagcaat aaaagagcaa actgcttcct tgggccagtg tccttgctca 2880tagatcagga aaccgaagca tgaagaatac aggcggcaga tgcctgaagg taacggacgt 2940gttcatggtg ctgacggtga tgataagtga cagatgtaga ctcatctcca aacttgtcag 3000gttatagaca ttaaatatgt gcaactttat gaatagcagt catgtctcaa tcaagtggtt 3060ttaataaaga aataatagga agccagagct gagagacagg gagggagttg ttcaaggtca 3120cctggcaagt gagctccggg gcggggagag ctcagctctg ggtggccagc ctggcttttt 3180ccactgctca gtgtccagct tgcagtctaa tgtctcgaat tacagagaag gagactggtc 3240agttcattca ttcattcatt ctacaaaggt ttatggagca tctctcctga ctgcaagctc 3300ttgaaggtga gagcagcaca aatgagggtc ccatggagag agaggccgga atgaaaaatg 3360tcaatgacaa atgcatatat aaaggcacat gtgtaattga aagagctttg agagaaagag 3420tcaagggact gttccagaga atagccatgg aaggggaaaa ggtccagtgt gataaggtat 3480tgcaaagaag tgacatttaa gcaaaagcct gcagcctatg cagaagttgg cctcagtgag 3540aaaggttggg ggagggttcc agtagagagg gaaggtatgc aaaggcccag agttaggaca 3600gaacttgctg tgtttgagaa actgggaaaa gaagagtgag cctgggggta tcacgtgatc 3660cagggcagag caggtccagg ccaggtgcag ccaggtcaca gcagccctag tgggttagag 3720cacaaatcaa agtttagcat ttatctgaaa cacaggagtt ggccatgagt ttcttaggcg 3780aggaagcgct gtgaccatat ttatgattga aggagattct tttatatgct gtatatagaa 3840agcctttcag ggcaaagaaa ggaagctact ggggtagccc tgggggagat gaagggagct 3900tccactgggg gcagtaagaa agccagggaa aggcggcagc tttaagacct gttttggaga 3960tagaacggac aagctttgct gatgggctgg agtggaacag gaagtcaaga ttacttcttc 4020tgggaagttc tgttcctggg tctttaggat ctagaggaag ctgtgacttt gtctctcatc 4080tctgcctggg ctccaagcct cacatccctt tttgtaatta gaagatattg gacagaccgt 4140cctcactaac acaattccca cagctgagtc cagggtagaa ctgggcagga cttcactgcc 4200caacacggga aatatcagtc agcagatttg ggtttcgggg atggtggtgg gccagcggga 4260agactgacca gggcctaccc atcacatccc caccacctcc cacctcaatt caccttggcc 4320tgagatgaca ggtgaacatg actgatcctc tctcttccct ctgcagaaac actaaagcca 4380gggaccagga gaggggcagc ccaaccaagc tttcaaagca ctcagtagag gctggtctgg 4440gggatgggag gctcccaggg cttcacctgt ctctgtcaaa gccatgtatt tccaccagag 4500gcccaagagt gcgatggcaa accctggatt tgaaactaag aaacgtaaaa caagcactga 4560ggactccact gcctcttgag tgacctctct gaccctctgt ttcttctgca ctgttaggat 4620aatgatacta actccatgtt gttgtagaga agtataaatg agctaataca ggtgaaccgc 4680ctggggatac caggaggtga ggtcgaggag gaacgaggta tcactcctca gagccactca 4740gagagaggct gtgcacgagt cagaggaacc tggattttaa ttccggttcc atcactcagt 4800agctgaaaca agctattcca cttcacttag cctcagtcta ttcaatctgt aaaatagagt 4860gagtttactt ttggaaaact ctgtaaaata gagagcttac ttttggtgaa ggttaaacat 4920agtaatattt atggagtgtc tagtatgtct ttaataatta gtggttttac tgaaaagtag 4980agagagttgg cccagaggga gcaagatttc tgggtctcaa acatgtagcc caggagagcc 5040taagtgaacc tggggccctc tccaaacaga tcctggggga gactcagtgc acacccggag 5100aagcagctcc tccccatcgg atctctagtg cttggcaggg ggcggggtct tgagggggtg 5160tccacaacac atggcagact gcagatgaag aaactgaggc ccagaggggg tgaggcttgc 5220ccagggtgac ctagtagctg aatagatggg agaatggagc cagggcctca ctgagactct 5280ctggtcagct gcccctgggc tgtatccaat aaggaaactc ccctgcttct gaagctgttc 5340tcgaaattat cagctcagtg tgaccctgtg gggggttgag ccacattgtt tctttagaag 5400catctccata catggctggt tccaaccctt ggcaggaggg accatattgt gctgtaaaat 5460agactcattt agagaagccg gagattaaag cacccaccta tgtccttcaa agctctccag 5520gcaagtgcca tggtgggaac aggtagggag tgtcagtggg gggaagccca gactctgctc 5580actcattatc tgcagattag ggctattgtt ggtggctact aagtcaggga tttcaaaatc 5640aggaagatgc agccaggaaa agaggaggca ggactctgca gaggaggcag gactctgcag 5700agtcagagtg ataaccgagt ctgagtccaa gctttgccag tgttagcaag cgactccatc 5760tctctgaacc tcggattacc catctgtaaa atagagctag cagcaagatg tacctttttg 5820ggtggtgcag ggctgaagga gttggcacag tgcctgaaag agggtgcggg caatgcgccc 5880aactgctgtg gctgctgggt ttggtgccag gttcgattct gcaggcagaa acttctacat 5940gaggctcctt ctcggaagga gctcaggaca caatttggag gctgggctgg caagggtgac 6000ctgctggagc tattcaactt cacttaaaga caggcctgca gtccaagcct gcccaattcc 6060tgagaccatt ctctctccac tgctgagccc cacggccact ctgcaaggga tttcccaccc 6120acctgtttgg ggccctttgg agtttggttt taattgggtc acgggatgct gtgacaggct 6180gcccctgcct ggtggggatc tggggtcact gatgacattg tgcccatgga gagagcccag 6240cagaaaggga ttccctccaa ggcgacacac agggcaaagc tcacatcaga agccaggcag 6300gccctctgca cctggtaatt agccggcccg ggtgctgtca ggctcacacg tgtgtgtgtg 6360tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg taaagcatgt accctatggt acagttgaga 6420atatggaggc ctcagatggg gcttttgcag aaactgccat gcctactgct cacacttcca 6480tagcacgtgc ccccaagcac cccatggtgt aggtgctgtt attatcacta tcttacagtt 6540atggagcagt ggctcaaggt gtaactgatt tgcccaaaat cacactacaa ggacacagca 6600gggctgagat ttgaacccag gcagtggctt cagagcctga gctgtttcct actgcagagg 6660gaggaggcaa gacttctacc cgtagccaga tggggaggca tgggcacagg aacggctctt 6720gggtgaagtg gagggaggaa gaggaggact gaaggcgaag gccacgtcag gagtgatggg 6780ataccccaca aaggcctccc tgagaagcgc tagagacaaa gatgagtgcc tcctcatctg 6840gaagatgaaa agatgtcttt gcctgcatgg gctgccgtca caaagtccca ggggctaggg 6900ggcttcaaca acagaaattt ctttctttac aactctggaa gctggaagtc tgagattaag 6960gcaccagcag gatttgttcc ttccaaggcc cctctccttg gctcacaggt ggctgccttc 7020tccctgtctt cacctggtct tccctctgtg catgtctcta tcctgatctc ctctttttaa 7080tttttgtgta aggacgtagt catattgggt tggggcccac tctagtgacc tcattctaac 7140tcagtcccct ctttaaaagc cctatctcca gatatagtca cattctgggg tattgaaggt 7200aaggacttca gtatatgcat tttgggggca caattcagcc agaacaggag gacggtgggg 7260atgtccacat gaagaggttc aggcagaatt cctttaggag gggaagatgt ctctctgtgg 7320gacaagggtg gcatggagca gcccctgggg gaaggagaag gggacagttt gcatactggt 7380attctgccta ccccagggtg gacactcact cagcgtttgc tgaatgaaca gggcaaggcc 7440agcagtgctg atggtcccag gcatgtagct ggtctgagtt catagaagga ccacagcgcc 7500ctgccatgtg ccaaaccagg acaccagagt gaaggccaga agctcacatg gaagcagctt 7560agttccctgg taacctcgag atgctgatga gacagagcag agcagaggga accctctccc 7620tccatatccc atcctccaaa atgtgtccct tgatgtggat gggtagacag gattcctgcc 7680ctggcagcca gacccctgcc ttgggtctgc acctcctctc cctccttcct ctccccgtca 7740tccctaaatc ttgtcctcga gccactgcca ccctgtgtaa accctcatgt ccagtcttgg 7800gggtgccatc ccttctcttt aaagctgaat ggaccaaaca tacccattga gtgttgggtg 7860gggacatctc tggaaagtca gcacctggac cagctccacc cctctctgag gacaccttct 7920ttccctttca gaacaaagaa cagccaccat gcagctcttc ctcctcttgt gcctggtgct 7980tctcagccct cagggggcct cccttcaccg ccaccacccc cgggagatga agaagagagt 8040cgaggacctc catgtaggtg ccacggtggc ccccagcagc agaagggact ttacctttga 8100cctctacagg gccttggctt ccgctgcccc cagccagaac atcttcttct cccctgtgag 8160catctccatg agcctggcca tgctctccct gggggctggg tccagcacaa agatgcagat 8220cctggagggc ctgggcctca acctccagaa aagctcagag aaggagctgc acagaggctt 8280tcagcagctc cttcaggaac tcaaccagcc cagagatggc ttccagctga gcctcggcaa 8340tgcccttttc accgacctgg tggtagacct gcaggacacc ttcgtaagtg ccatgaagac 8400gctgtacctg gcagacactt tccccaccaa ctttagggac tctgcagggg ccatgaagca 8460gatcaatgat tatgtggcaa agcaaacgaa gggcaagatt gtggacttgc ttaagaacct 8520cgatagcaat gcggtcgtga tcatggtgaa ttacatcttc tttaaaggta aggcccttgg 8580gcccaaacct gcactttctt tggcttttct gctgctttta tctaaagaat acccaattcc 8640ctcacataca taaaagacgg ggagtacgtt aagttctttt gggtgcctgt tgagaaaaat 8700taagtaaaca agcagccaga gaaggtaaga tgaatgcctt cttgctgtgg atgggattag 8760tgaggctgag atgctgtttc ctccacggag gaagagctgg ttgctgtctt cgggcccctg 8820gggacatctg aagccccagc tttctacagg ctctgaagta tgaacccatt gtggccacca 8880tggcaaagac accaacacct tagccactca gggcaggaca cagaccccag aagggcttaa 8940agggcatttc ccagtccccc gtatccctca gatcttggcc cctctgccct catagaggcc 9000aagactccct cagacaaatg cttgttcctc tgaaatgcct cctcctgact cctcagcaag 9060agctgacctc tgcttatctc cccgacactc cttgtaagca ttcctgctcg cctctgcagc 9120tcctgccagt tgctgaccct ggggaaagca agagtggata gagaggagaa gagaggagag 9180gagagggtgg gaagggttgc gaaggaaggt aaattgttaa cacctcccct tcctatggtc 9240acagatcatg agtatctttg gccatttggg tggctataac aaaataccat aaactgggtg 9300gcttagcaac aacaaacata tatttctcat agttctggag gctgagaagt ccaggatcaa 9360ggcactggca gatgcagtgc ccattccttg gttcatagag agtgccttct tagtatatcc 9420ttgctggaag gaggaaggca gctctctgtg gtctcttttg taaggacacc gatcctgttc 9480atgacagctc cacccccatg acctaatcaa ctcccaaagg ccccctgtcc taataccacc 9540accttggggg ttaggtttca acatatgaac aatgtgggga cacaaacatt gagaccacag 9600cagtgagtgt cgaacttgga ctctgagatt tcctatcccc tggtgcaggg cagtccccat 9660tacaccagat tgctgagggc agctgggaaa taagctaagg acggtattga ctggggtctt 9720ccttcgataa cgattaagaa gttggaaaca ggccaggcat ggtggctcac gcctataatc 9780ccaacatttt aggaggccga gatgggcaga tcacctgagg tcaggagttc gagatcagcc 9840tggccaacat agtgaaaccc cgtctctact aaaaaataca gaaattagcc aggcatggtg 9900gtgggcgcct gtaattccag ctacttggga ggctgaggca ggagaatcac ttgaacctgg 9960gaggtggggg ctgtagtgag ccaaaattgc gccactgcac tccagcatgg gtcacagagc 10020gagactccat ctcaaaaaga agaaaaaaag aaaaaaaaga aaaaaagaaa taaaataaaa 10080taaaaagaag ttggaaacaa tcacttgtag cgttttgttc agaagttccc ataggaaggt 10140cagagaaggg tcattgaaga cttcccaatg ggaaaaacca ttcatttcca ggatccatac 10200taacttcttt ctaaaattta aatcaaaata ttggaatgaa agtgcaaaca gagaagttca 10260cccagatatc aggtagcatt cacagccagc cacatttttc accctcttca cttggagatt 10320tggtcttgag taaaacgtta gagaatcaga gaacatcagg gatccagggc ctctgaagat 10380gtgaaaacca acctccttgt tttgcaaatg tggaaggaaa agtcccacga aaagtccaag 10440aatgtgccca atgttataaa gagacttgcc ttcatattca agaggttcaa cagtcactgc 10500tctggggctg ccataaagat ggtctccgct ggctatcttt actgtcttca ctccttttat 10560ttgcagctga gaatttctaa ttctgacaca aaattctttt tcatttttcc cttttttcat 10620ctttagctaa gtgggagaca agcttcaacc acaaaggcac ccaagagcaa gacttctacg 10680tgacctcgga gactgtggtg cgggtaccca tgatgagccg cgaggatcag tatcactacc 10740tcctggaccg gaacctctcc tgcagggtgg tgggggtccc ctaccaaggc aatgccacgg 10800ctttgttcat tctccccagt gagggaaaga tgcagcaggt ggagaatgga ctgagtgaga 10860aaacgctgag gaagtggctt aagatgttca aaaagaggta ctttcagact accccagggc 10920cagcctaaac ccacacagcc ccagggagac acacacgccc taccagggcc acacagcact 10980ggtgggaagg actcacccag ccaaggagct gcctccaggc ccagaggcat cctgtgacat 11040ccaagtcctg ggggcctagc ccagttggag ggacaagagc tggaaactgg gttccttagg 11100gtggtgccag agtgggcaga gacctctggg cagcccacgt ccaagtccag agcaagggga 11160ggctcatcct agaaaagagg ccagaggagc cataaccacc attgttcctt gggttaagga 11220gtcctttttt aaaaccatca aaactaagaa tccagtgcat tatgaatcca aggggtgagg 11280ctcagtgtgc caatgcccca gaacagtcta agaaagctcc ttttcccttt ccaggcagct 11340cgagctttac cttcccaaat tctccattga gggctcctat cagctggaga aagtcctccc 11400cagtctgggg atcagtaacg tcttcacctc ccatgctgat ctgtccggca tcagcaacca 11460ctcaaatatc caggtgtctg aggtgggttc agaagctcct atgcatctgc ttcccaagat 11520ctattctgtt ctattctttc tattctactc taccccattt cattccattc cattccactc 11580aactccactc cactccactc cactccagtt cactctattc aattccactc cactccagtt 11640cactctattc aattccactc cactccactc cagttcactc tattcagttc cactccactc 11700cactccactc cactccagtt cactctattc cattccactc cattccactc ctccactcct 11760ctcatccact ccactctact cctccactcc acatctccac tccactcctc cactccactc 11820ctccactcca ctcatccact ccactcctcc actccactcc tccactccac tcctccactc 11880cactccactc atccactcca ctcttccatt ccactccatt ccactcctcc actccactct 11940tccactccac tccattccac tcctccactc cactccactc tattctattc tattccattc 12000cattctactc tattctattc cattccattg cagtcaactc cactccactc tctactattc 12060tattccactc ctctcccctc cactccattc cattgcagtc cactccactg cactccactc 12120ctttattctg ttctgttcta ttctattcta ttctattcta ttctctccct ctccctctct 12180tttcccacaa gtagtgaaag tttcactttg tgtcttatcc ttcatgtaat gggaagccat 12240atccaccact gttccttgag ttaaggagtc ctgttttaaa caatcaaaac taagaaggca 12300cttcctagct atgtgatctc caaaaaatac ttgactctct gagcttcctt tctctcttct 12360ataaaattga agaattacac cttgctcaaa gatgccatga gaattcaatg acagacacat 12420gcgaagtcac cccccagcac agtgcctggg gcagagtagc tgctccattg ttccatttcc 12480tacttgctcc atggctcagt tgaacagata cttagaggtt gatgcccata ggcagaagct 12540ttgccatttg ctatgatgac ttcacctgcc cctggtggcc tggtgatgcc tggtgtctcc 12600cctgcagatg gtgcacaaag ctgtggtgga ggtggacgag tcgggaacca gagcagcggc 12660agccacgggg acaatcttca ctttcaggtc ggcccgcctg aactctcaga ggctagtgtt 12720caacaggccc tttctgatgt tcattgtgga taacaacatc ctcttccttg gcaaagtgaa 12780ccgcccctga ggtggggctt ctcctgaaat ctacaggcct cagggtggga gatgaagggg 12840gctatgctat ggcccatctg tatgctggta gctagtgatt tacacaggtt tagttgacta 12900atgaggcatt acaaataata ttactctatg atgattgctt ccacccacac gactgcaaca 12960tacaggtgcc ttggggaaat gtggagaaca ttcaatcttg ccgtcactat tcatcaatga 13020agattagcac tgagatccag agaggctgga tgacttgctc aagttcacca gcatggtagt 13080ggcaaagaga ggtccagagt cctggccctt gatgcccagc tcagtgccac aaagctcagt 13140aggagggatg ttccagtgga tgagggccac caggaagcac aggtccaagg ctggtcccac 13200acttatcagc agcaacaact gtcagttcat cctgcatggg aaaaatgttg gaatgggagt 13260ctgaaatggg gctactgttt cagtcctaac gtgctgtgtg acattgggac aacactttcc 13320ctctctggac ctcagtttcc ctctgtatac aaggatcaga ttcttgctgt gacccaagaa 13380ctcctgaaat catatagaaa ggctggggtg ggccctgtca ttcgtggttg atttcaatac 13440actcaagtgc cattcatcct ttaagaaaaa catctggata tcaaggtgga aatggcccat 13500ttaatgattg attatatcat tttgtggata tagttataat ctgatgggcc tggctgggag 13560tggaagaagg gaagcctttt gcaaatagta gagtgtcagt tgcaggtgcc aatgactaac 13620tttttgaatt ctatgttggc attaacaata aagcattttg caaacactgg ttataactgt 13680ctttatggag gcagctctgg gaatggtgac attgatagct taccatgctc caggccgggt 13740gcctggccct tcacctggat ggtcgcattt gcccctcata agactcccat gaagaaaggc 13800accactatta tcccatctgt tattcacaga tgggaaaggc aaggcttgaa gtggttaggt 13860ggcttaccca gtcacatatc ttctaagtgg tgcagccaga atttggcggg gggagtgcga 13920ccaagaaccc tacactcagt cctgtgctct gtgctgtgga ggagagatga ccaggagcag 13980aaacttcatt caggggcatc tcaggcacca gctcccccat gagccagcta agttccctcc 14040ctcccttcac caagcaccat gtgtttcctc atgtgccaaa tgaagaggat tagatactca 14100agaatggaat gagtgggtga gtgagtcctt cgctgcaccc aagtctgatt ttctgtgcgc 14160ctgctcaccc caccctgcat gttctaagca tgcttccata aggctgtgcc ccaccctctg 14220attctagagt ctggactgta tcagaggtga gtgcctacta gaggtaacaa ggtcaggacc 14280ccaaaccttg tccatccccc aaagtactga gcccccacca tgcaccagcc catgccagat 14340gctttgcact tgtgatatca cccatccctt gacaacccag caagttctat tattgttccc 14400attttacagg caataacata agtgctttcc cagggtccca cgctggtgac agtgagggcc 14460cagggtctga gagcccagat cgcacatgtg cgggctggtg gcaggggaga tggcagcaac 14520cagactcaga catttctctg cagttgtgct gtgggctcag ggtggctctt tacgaagggg 14580ccccttcgtg gggtcatgca ctcctgtgtg ctttcccttg catcatgcct tgcctgtctt 14640ggcaaatatt tctctggagt ttacccagcc agtccaaggt cacagggaag ccctgtctgt 14700gtctcacaca gaaggtcaac gtccagcact gtccaaactt tactcagcaa acagtcacaa 14760agcagctcct gtgtgggggt cggggtggct cactgtggtc tctgctgcat gtcacacatt 14820gaagcactgt gctggggtca tcgcaggctg tttaactcaa ttgtcacatg agcctgggtg 14880cacaaaatgg tagagcagct cagagagaga tggacagaca gcatgaacct ctgaggagtc 14940aggttttctt ggatgaaggg acactaagat ggctttggag cgtgagaagg acctcaccta 15000gcaaatgtgg gaaaggagtg agacctccag gcagagggac tggctggaga cgagcgtgat 15060gtggtgagcc atggagtgta tgggtcccca cagaacttca gtctgggcct gcacagggca 15120tgtggaggag acaaggagga gggaggtcgg tgccggcggt tcagtgacag agatcctaaa 15180tgggaggcca gtgttttgtc tgatctcttt catcccaatt tcagggtagt ttggtcatcc 15240acgccacatt ccaagtgtcc cctgggccct ttctctccct cacccccctg tctgcacatg 15300agtagatgcc tccacgcagc cctcccagga cgctcacctc tatccacaga tgcttctcca 15360aaacccacca ggccctccca tggaacgagc tcacctacag ggtaaaatca ggtcacggtc 15420acatataggc ctgactactc ccctcaggac cctcattcac agccactgta ttaatttgct 15480ggggctgcca aaacaaagtg tcctcatctg ggaggctgca gtagatttgc tgaaattgat 15540ttgctagcgt tgctgaaatt gattcaagct t 15571 97 4279 DNA Homo sapiens 97cagacaggat attcactgct gtggcaaggc ctgtagagag tttcgaagtt aggaggactc 60aagacggtcc ctccctggac ttttctgaag gggctcaaaa gatgacacgc gccagagctg 120gaaggcgtcg ccaattggtc caacttttcc ctcctccctt tttgcggatg agaaaaactg 180aggcccaggt ttgggatttc cagagcccgg gatttcccgg caacgccgac aaccacattc 240ccccggctat tctgacccgc cccggttccg ggacgctccc tgggagccgc cgccgagggc 300ctgctgggac tcccggggac cccgccgtcg gggcagcccc cacgcccggc gccgcccgcc 360ggaacggcgc cgctgttgcg cacttgcagg ggagccggcg actgagggcg aggcagggag 420ggagcaagcg gggctgggag ggctgctggc gcgggctcgc cggctgtgta tggtctatcg 480caggcagctg acctttgagg aggaaatcgc tgctctccgc tccttcctgt agtaacagcc 540gccgctgccg ccgccgccag gaacccggcc gggagcgaga gccgcggggc gcagagccgg 600cccggctgcc ggacggtgcg gccccaccag gtgaacggcc atggcgggct ggatccaggc 660ccagcagctg cagggagacg cgctgcgcca gatgcaggtg ctgtacggcc agcacttccc 720catcgaggtc cggcactact tggcccagtg gattgagagc cagccatggg atgccattga 780cttggacaat ccccaggaca gagcccaagc cacccagctc ctggagggcc tggtgcagga 840gctgcagaag aaggcggagc accaggtggg ggaagatggg tttttactga agatcaagct 900gaggcactac gccacgcagc tccagaaaac atatgaccgc tgccccctgg agctggtccg 960ctgcatccgg cacattctgt acaatgaaca gaggctggtc cgagaagcca acaattgcag 1020ctctccggct gggatcctgg ttgacgccat gtcccagaag caccttcaga tcaaccagac 1080atttgaggag ctgcgactgg tcacgcagga cacagagaat gagctgaaga aactgcagca 1140gactcaggag tacttcatca tccagtacca ggagagcctg aggatccaag ctcagtttgc 1200ccagctggcc cagctgagcc cccaggagcg tctgagccgg gagacggccc tccagcagaa 1260gcaggtgtct ctggaggcct ggttgcagcg tgaggcacag acactgcagc agtaccgcgt 1320ggagctggcc gagaagcacc agaagaccct gcagctgctg cggaagcagc agaccatcat 1380cctggatgac gagctgatcc agtggaagcg gcggcagcag ctggccggga acggcgggcc 1440ccccgagggc agcctggacg tgctacagtc ctggtgtgag aagttggccg agatcatctg 1500gcagaaccgg cagcagatcc gcagggctga gcacctctgc cagcagctgc ccatccccgg 1560cccagtggag gagatgctgg ccgaggtcaa cgccaccatc acggacatta tctcagccct 1620ggtgaccagc acattcatca ttgagaagca gcctcctcag gtcctgaaga cccagaccaa 1680gtttgcagcc accgtacgcc tgctggtggg cgggaagctg aacgtgcaca tgaatccccc 1740ccaggtgaag gccaccatca tcagtgagca gcaggccaag tctctgctta aaaatgagaa 1800cacccgcaac gagtgcagtg gtgagatcct gaacaactgc tgcgtgatgg agtaccacca 1860agccacgggc accctcagtg cccacttcag gaacatgtca ctgaagagga tcaagcgtgc 1920tgaccggcgg ggtgcagagt ccgtgacaga ggagaagttc acagtcctgt ttgagtctca 1980gttcagtgtt ggcagcaatg agcttgtgtt ccaggtgaag actctgtccc tacctgtggt 2040tgtcatcgtc cacggcagcc aggaccacaa tgccacggct actgtgctgt gggacaatgc 2100ctttgctgag ccgggcaggg tgccatttgc cgtgcctgac aaagtgctgt ggccgcagct 2160gtgtgaggcg ctcaacatga aattcaaggc cgaagtgcag agcaaccggg gcctgaccaa 2220ggagaacctc gtgttcctgg cgcagaaact gttcaacaac agcagcagcc acctggagga 2280ctacagtggc ctgtccgtgt cctggtccca gttcaacagg gagaacttgc cgggctggaa 2340ctacaccttc tggcagtggt ttgacggggt gatggaggtg ttgaagaagc accacaagcc 2400ccactggaat gatggggcca tcctaggttt tgtgaataag caacaggccc acgacctgct 2460catcaacaag cccgacggga ccttcttgtt gcgctttagt gactcagaaa tcgggggcat 2520caccatcgcc tggaagtttg attccccgga acgcaacctg tggaacctga aaccattcac 2580cacgcgggat ttctccatca ggtccctggc tgaccggctg ggggacctga gctatctcat 2640ctatgtgttt cctgaccgcc ccaaggatga ggtcttctcc aagtactaca ctcctgtgct 2700ggctaaagct gttgatggat atgtgaaacc acagatcaag caagtggtcc ctgagtttgt 2760gaatgcatct gcagatgctg ggggcagcag cgccacgtac atggaccagg ccccctcccc 2820agctgtgtgc ccccaggctc cctataacat gtacccacag aaccctgacc atgtactcga 2880tcaggatgga gaattcgacc tggatgagac catggatgtg gccaggcacg tggaggaact 2940cttacgccga ccaatggaca gtcttgactc ccgcctctcg ccccctgccg gtcttttcac 3000ctctgccaga ggctccctct catgaatgtt tgaatcccac gcttctcttt ggaaacaata 3060tgcaatgtga agcggtcgtg ttgtgagttt agtaaggctg tgtacactga cacctttgca 3120ggcatgcatg tgcttgtgtg tgtgtgtgtg tgtccttgcg catgagctac gcctgcctcc 3180cctgtgccag tcctgggatg tggctgcagc agcggtggcc ggcctctttt cagatcatgg 3240catccaagag tgcgccgagt ctgtctctgt catggtagag accgagcctc tgtcactgca 3300ggcactcaat gcagccagac ctattcctcc tgtgcccctc atctgctcag cagctatttg 3360aatgagatga ttcagaaggg gaggggagac aggtaacgtc tgtaagctga agtttcactc 3420cggagtgaga agctttgccc tcctaagaga gagagacaga gagacagaga gagagaaaga 3480gagagtgtgt gggtctatgt aaatgcatct gtcctcatgt gttgatgtaa ccgattcatc 3540tctcagaagg gaggctgggg ttcattttcg agtagtattt tatactttag tgaacgtgga 3600ctccagactc tctgtgaacc ctatgagagc gcgtctgggc ccggccatgt ccttagcaca 3660ggggggccgc cggtttgagt gagggtttct gagctgctct gaattagtcc ttgcttggct 3720gcttggcctt gggttcattc aagctcacga tgctgttccc acgtttcccg ggatatatat 3780tctctcccct ccgttgggcc ccagccttct ttgcttgcct ctctgtttgt aaccttgtcg 3840acaaagaggt agaaaagatt gggtctagga tatggtgggt ggacaggggc cccgggactt 3900ggagggttgg tcctcttgcc tcctggaaaa aacaaaaaca aaaaactgca gtgaaagaca 3960agctgcaaat cagccatgtg ctgcgtgcct gtggaatctg gagtgagggg taaaagctga 4020tctggtttga ctccgctgga ggtggggcct ggagcaggcc ttgcgctgtt gcgtaactgg 4080ctgtgttctg gtgaggcctt gctcccaacc ccacacgctc ctccctctga ggcgtgagga 4140ctcgcagtca ggggcagctg accatggaag attgagagcc caaggtttaa acttcttctc 4200tgaagggagg tggggatgag aagaggggtt tttttgtact ttgtacaaag accacacatt 4260tgtgtaaaca gtgttttgg 4279 98 3799 DNA Homo sapiens 98 ctggcactgggtggtaacca gcaagccagc tggcatccgc atccagggtt tgtttcaatg 60 atgtctcgtggagaatatgg aggggctggt gccaggactg tccttggctt tgcctcgggg 120 tgtgaacggggtcagtgacc tctaaaacta acctgcctct cagttctgaa tccagacaga 180 atcaatcctcagctgtgtct cgctccacac cccctgccct ggaagccagg gaaggttgga 240 ggtgctagggggtcaggctc ccctctgtga cccctgcagc tgttgtggtg actcatgtcc 300 caacctagctgcctctccca aggagacttt cccctgggac aagggggagg gaatggcatg 360 gaggaggcccacatcaagcg gggccaggaa cccacggtgg caggagctgg gctggtgacc 420 tacccagggcagaagggccc gggactcatc cagaggggaa ggaaggggtc ttcaggaaga 480 ccacggagatgccacaggca gaattggctt cccatctggg agataggtgg ggagaccctg 540 gcattttgacagccagaacc tggggtgctg agcagaatct tcatgcctgg cctggccgcc 600 ttcggagggaagctggaggg ttgggtgcga gaggagtggg gtcagagccc ctacatccgc 660 aggaccccaaatcggctggg ccccaaggcc cggactgcgc tccccggtgg ccccggcggc 720 cctccgcgaatgcgtcctgc ccctcccctg cccaagccct ctgccctcac ccgggtccgg 780 cgccgcccccgaagtggcgg gaacaacccg aacccgaacc ttctgtcctc gggagccccc 840 agataagcggctgggaaccc gcggggcccg caggggaggc ccggctgttc cgcccgctaa 900 gtgcattagcacagctcacc tcccctatcg cgcctgccat cggacgggca gtgccgcgcc 960 ctgctctggggcccccggag cgaccacagc ggaggccgga acggactgtc ctttctgggg 1020 cggggtggggagggggtgtc gctggagggc ccggtggcat agcaacggac gagagaggcc 1080 tggaggaggggcggggaggg ggagttgtgt ggcagttcta agggaagggt gggtgctggg 1140 acgggtgtccgggagggagg ggagcctggc ggggtctggg gcctcgtcgc ggagggcgct 1200 gcgagggggaaactggggaa agggcctaat tccccagtct ccacctcgaa tcaggaaaga 1260 gaaggggcgggctgctgggc aaaagaggtg aatggctgcg gggggctgga gaagagagat 1320 gggaggggccggccggcggg ggtgaggggg tctaaagatt gtgggggtga ggaactgagg 1380 gtggggggcgcccagaggcg ggactcgggg cggggcaggc gaggcggagg gcgagggctg 1440 cgggagcaagtacggagccg ggggtgtggg ggacgattgc cgctgcagcc gccgccccac 1500 tcacctccggtgtgtctgca gcccggacac taagggagat ggatgaatgg gtggggagga 1560 tgcggcgcacatggccccgg gcggctcggc ggtcagctgc cgcccccaca gcggaccggt 1620 cggggcgggggtcgggcggt agaaaaaagg gccgcgaggc gagcggggca ctgggcggac 1680 cgcggcggcagcatgagcgg cgcagaccgt agccccaatg cgggcgcagc ccctgactcg 1740 gccccgggccaggcggcggt ggcttcggcc taccagcgct tcgagccgcg cgcctacctc 1800 cgcaacaactacgcgccccc tcgcggggac ctgtgcaacc cgaacggcgt cgggccgtgg 1860 aagctgcgctgcttggcgca gaccttcgcc accggtgagc gggggaaact gaggcacgag 1920 ggacaagaggtcgtcgggga gtgaaagcag gcgcagggaa ataaaaagaa ggaaagggag 1980 acagaccaggcgcctaacag atggggacca agaaacaaga gatagctgag aggtgcaaac 2040 agaagagaaaaaggagcaac atcccttagg agaggggcag aggagagaga ggtggagaga 2100 gggggcggagagtgctcaga attgagagct aaggtggggg atgcaggaca gactgaggtg 2160 gagatgcataggaggaaatg gaggcagatg tgggacaggg gtgagaaact ccaggatttc 2220 ctcgctgagcctggctggta ggtatagttg ttttctttct ttttctttat tttattttca 2280 tttatttacttatttttatt ttttatttgt tttgagacgg agtttcgctc ttgttgccca 2340 ggctggagtacaatggcgcc atctcggctc actgcaacct ccgcctcccc gggttcaagc 2400 gattctcttgcctcagcttc cctagtagct gggattacag gcatgcgccc ccatgcctgg 2460 ctaatttatttgtattttta gtagagacgg gacttctcca tgttggtcag gctggtctcg 2520 aactcccaaccttaggatcc acccaccccg gcctcccaaa gtgctgggat tacaggtgtg 2580 agccactgcgcccggccagt aggtatagtc ttctagatgt gaaacctgag tctcagagcg 2640 gtgaagttcccttccgaagg gcagcccatg ttggagctgg gttcagtcta actctggggc 2700 caatgctttttccagatgga gacacatttg cagaggagaa ggaagaacta gagagaggca 2760 gggagatgcaggggagggaa gggtaaggag gcaggggctg cctgggctgg ctggcaccag 2820 gaccctcttcctctgccctg cccaggtgaa gtgtccggac gcaccctcat cgacattggt 2880 tcaggccccaccgtgtacca gctgctcagt gcctgcagcc actttgagga catcaccatg 2940 acagatttcctggaggtcaa ccgccaggag ctggggcgct ggctgcagga ggagccgggg 3000 gccttcaactggagcatgta cagccaacat gcctgcctca ttgagggcaa ggggtaagga 3060 ctggggggtgagggttgggg aggaggcttc ccatagagtg gctggttggg gcaacagagg 3120 cctgagcgtagaacagcctt gagccctgcc ttgtgcctcc tgcacaggga atgctggcag 3180 gataaggagcgccagctgcg agccagggtg aaacgggtcc tgcccatcga cgtgcaccag 3240 ccccagcccctgggtgctgg gagcccagct cccctgcctg ctgacgccct ggtctctgcc 3300 ttctgcttggaggctgtgag cccagatctt gccagctttc agcgggccct ggaccacatc 3360 accacgctgctgaggcctgg ggggcacctc ctcctcatcg gggccctgga ggagtcgtgg 3420 tacctggctggggaggccag gctgacggtg gtgccagtgt ctgaggagga ggtgagggag 3480 gccctggtgcgtagtggcta caaggtccgg gacctccgca cctatatcat gcctgcccac 3540 cttcagacaggcgtagatga tgtcaagggc gtcttcttcg cctgggctca gaaggttggg 3600 ctgtgagggctgtacctggt gccctgtggc ccccacccac ctggattccc tgttctttga 3660 agtggcacctaataaagaaa taataccctg ccgctgcggt cagtgctgtg tgtggctctc 3720 ctgggaagcagcaagggccc agagatctga gtgtccgggt aggggagaca ttcaccctag 3780 gctttttttccagaagctt 3799 99 1550 DNA Homo sapiens 99 tgccgccgtc ccgcccgccagcgccccagc gaggaagcag cgcgcagccc gcggcccagc 60 gcacccgcag cagcgcccgcagctcgtccg cgccatgttc caggcggccg agcgccccca 120 ggagtgggcc atggagggcccccgcgacgg gctgaagaag gagcggctac tggacgaccg 180 ccacgacagc ggcctggactccatgaaaga cgaggagtac gagcagatgg tcaaggagct 240 gcaggagatc cgcctcgagccgcaggaggt gccgcgcggc tcggagccct ggaagcagca 300 gctcaccgag gacggggactcgttcctgca cttggccatc atccatgaag aaaaggcact 360 gaccatggaa gtgatccgccaggtgaaggg agacctggct ttcctcaact tccagaacaa 420 cctgcagcag actccactccacttggctgt gatcaccaac cagccagaaa ttgctgaggc 480 acttctggga gctggctgtgatcctgagct ccgagacttt cgaggaaata cccccctaca 540 ccttgcctgt gagcagggctgcctggccag cgtgggagtc ctgactcagt cctgcaccac 600 cccgcacctc cactccatcctgaaggctac caactacaat ggccacacgt gtctacactt 660 agcctctatc catggctacctgggcatcgt ggagcttttg gtgtccttgg gtgctgatgt 720 caatgctcag gagccctgtaatggccggac tgcccttcac ctcgcagtgg acctgcaaaa 780 tcctgacctg gtgtcactcctgttgaagtg tggggctgat gtcaacagag ttacctacca 840 gggctattct ccctaccagctcacctgggg ccgcccaagc acccggatac agcagcagct 900 gggccagctg acactagaaaaccttcagat gctgccagag agtgaggatg aggagagcta 960 tgacacagag tcagagttcacggagttcac agaggacgag ctgccctatg atgactgtgt 1020 gtttggaggc cagcgtctgacgttatgagt gcaaaggggc tgaaagaaca tggacttgta 1080 tatttgtaca aaaaaaaagttttatttttc taaaaaaaga aaaaagaaga aaaaatttaa 1140 agggtgtact tatatccacactgcacactg cctagcccaa aacgtcttat tgtggtagga 1200 tcagccctca ttttgttgcttttgtgaact ttttgtaggg gacgagaaag atcattgaaa 1260 ttctgagaaa acttcttttaaacctcacct ttgtggggtt tttggagaag gttatcaaaa 1320 atttcatgga aggaccacattttatattta ttgtgcttcg agtgactgac cccagtggta 1380 tcctgtgaca tgtaacagccaggagtgtta agcgttcagt gatgtggggt gaaaagttac 1440 tacctgtcaa ggtttgtgttaccctcctgt aaatggtgta cataatgtat tgttggtaat 1500 tattttggta cttttatgatgtatatttat taaagagatt tttacaaatg 1550 100 4673 DNA Homo sapiens 100tttgctcctg ctcctccgct cctcctgcgc ggggtgctga aacagcccgg ggaagtagag 60ccgcctccgg ggagcccaac cagccgaacg ccgccggcgt cagcagcctt gcgcggccac 120agcatgaccg ctcgcggcct ggcccttggc ctcctcctgc tgctactgtg tccagcgcag 180gtgttttcac agtcctgtgt ttggtatgga gagtgtggaa ttgcatatgg ggacaagagg 240tacaattgcg aatattctgg cccaccaaaa ccattgccaa aggatggata tgacttagtg 300caggaactct gtccaggatt cttctttggc aatgtcagtc tctgttgtga tgttcggcag 360cttcagacac taaaagacaa cctgcagctg cctctacagt ttctgtccag atgtccatcc 420tgtttttata acctactgaa cctgttttgt gagctgacat gtagccctcg acagagtcag 480tttttgaatg ttacagctac tgaagattat gttgatcctg ttacaaacca gacgaaaaca 540aatgtgaaag agttacaata ctacgtcgga cagagttttg ccaatgcaat gtacaatgcc 600tgccgggatg tggaggcccc ctcaagtaat gacaaggccc tgggactcct gtgtgggaag 660gacgctgacg cctgtaatgc caccaactgg attgaataca tgttcaataa ggacaatgga 720caggcacctt ttaccatcac tcctgtgttt tcagattttc cagtccatgg gatggagccc 780atgaacaatg ccaccaaagg ctgtgacgag tctgtggatg aggtcacagc accatgtagc 840tgccaagact gctctattgt ctgtggcccc aagccccagc ccccacctcc tcctgctccc 900tggacgatcc ttggcttgga cgccatgtat gtcatcatgt ggatcaccta catggcgttt 960ttgcttgtgt tttttggagc attttttgca gtgtggtgct acagaaaacg gtattttgtc 1020tccgagtaca ctcccatcga tagcaatata gctttttctg ttaatgcaag tgacaaagga 1080gaggcgtcct gctgtgaccc tgtcagcgca gcatttgagg gctgcttgag gcggctgttc 1140acacgctggg ggtctttctg cgtccgaaac cctggctgtg tcattttctt ctcgctggtc 1200ttcattactg cgtgttcgtc aggcctggtg tttgtccggg tcacaaccaa tccagttgac 1260ctctggtcag cccccagcag ccaggctcgc ctggaaaaag agtactttga ccagcacttt 1320gggcctttct tccggacgga gcagctcatc atccgggccc ctctcactga caaacacatt 1380taccagccat acccttcggg agctgatgta ccctttggac ctccgcttga catacagata 1440ctgcaccagg ttcttgactt acaaatagcc atcgaaaaca ttactgcctc ttatgacaat 1500gagactgtga cacttcaaga catctgcttg gcccctcttt caccgtataa cacgaactgc 1560accattttga gtgtgttaaa ttacttccag aacagccatt ccgtgctgga ccacaagaaa 1620ggggacgact tctttgtgta tgccgattac cacacgcact ttctgtactg cgtacgggct 1680cctgcctctc tgaatgatac aagtttgctc catgaccctt gtctgggtac gtttggtgga 1740ccagtgttcc cgtggcttgt gttgggaggc tatgatgatc aaaactacaa taacgccact 1800gcccttgtga ttaccttccc tgtcaataat tactataatg atacagagaa gctccagagg 1860gcccaggcct gggaaaaaga gtttattaat tttgtgaaaa actacaagaa tcccaatctg 1920accatttcct tcactgctga acgaagtatt gaagatgaac taaatcgtga aagtgacagt 1980gatgtcttca ccgttgtaat tagctatgcc atcatgtttc tatatatttc cctagccttg 2040gggcacatca aaagctgtcg caggcttctg gtggattcga aggtctcact aggcatcgcg 2100ggcatcttga tcgtgctgag ctcggtggct tgctccttgg gtgtcttcag ctacattggg 2160ttgcccttga ccctcattgt gattgaagtc atcccgttcc tggtgctggc tgttggagtg 2220gacaacatct tcattctggt gcaggcctac cagagagatg aacgtcttca aggggaaacc 2280ctggatcagc agctgggcag ggtcctagga gaagtggctc ccagtatgtt cctgtcatcc 2340ttttctgaga ctgtagcatt tttcttagga gcattgtccg tgatgccagc cgtgcacacc 2400ttctctctct ttgcgggatt ggcagtcttc attgactttc ttctgcagat tacctgtttc 2460gtgagtctct tggggttaga cattaaacgt caagagaaaa atcggctaga catcttttgc 2520tgtgtcagag gtgctgaaga tggaacaagc gtccaggcct cagagagctg tttgtttcgc 2580ttcttcaaaa actcctattc tccacttctg ctaaaggact ggatgagacc aattgtgata 2640gcaatatttg tgggtgttct gtcattcagc atcgcagtcc tgaacaaagt agatattgga 2700ttggatcagt ctctttcgat gccagatgac tcctacatgg tggattattt caaatccatc 2760agtcagtacc tgcatgcggg tccgcctgtg tactttgtcc tggaggaagg gcacgactac 2820acttcttcca aggggcagaa catggtgtgc ggcggcatgg gctgcaacaa tgattccctg 2880gtgcagcaga tatttaacgc ggcgcagctg gacaactata cccgaatagg cttcgccccc 2940tcgtcctgga tcgacgatta tttcgactgg gtgaagccac agtcgtcttg ctgtcgagtg 3000gacaatatca ctgaccagtt ctgcaatgct tcagtggttg accctgcctg cgttcgctgc 3060aggcctctga ctccggaagg caaacagagg cctcaggggg gagacttcat gagattcctg 3120cccatgttcc tttcggataa ccctaacccc aagtgtggca aagggggaca tgctgcctat 3180agttctgcag ttaacatcct ccttggccat ggcaccaggg tcggagccac gtacttcatg 3240acctaccaca ccgtgctgca gacctctgct gactttattg acgctctgaa gaaagcccga 3300cttatagcca gtaatgtcac cgaaaccatg ggcattaacg gcagtgccta ccgagtattt 3360ccttacagtg tgttttatgt cttctacgaa cagtacctga ccatcattga cgacactatc 3420ttcaacctcg gtgtgtccct gggcgcgata tttctggtga ccatggtcct cctgggctgt 3480gagctctggt ctgcagtcat catgtgtgcc accatcgcca tggtcttggt caacatgttt 3540ggagttatgt ggctctgggg catcagtctg aacgctgtat ccttggtcaa cctggtgatg 3600agctgtggca tctccgtgga gttctgcagc cacataacca gagcgttcac ggtgagcatg 3660aaaggcagcc gcgtggagcg cgcggaagag gcacttgccc acatgggcag ctccgtgttc 3720agtggaatca cacttacaaa atttggaggg attgtggtgt tggcttttgc caaatctcaa 3780attttccaga tattctactt caggatgtat ttggccatgg tcttactggg agccactcac 3840ggattaatat ttctccctgt cttactcagt tacatagggc catcagtaaa taaagccaaa 3900agttgtgcca ctgaagagcg atacaaagga acagagcgcg aacggcttct aaatttctag 3960ccctctcgca gggcatcctg actgaactgt gtctaagggt cggtcggttt accactggac 4020gggtgctgca tcggcaaggc caagttgaac accggatggt gccaaccatc ggttgtttgg 4080cagcagcttt gaacgtagcg cctgtgaact caggaatgca cagttgactt gggaagcagt 4140attactagat ctggaggcaa ccacaggaca ctaaacttct cccagcctct tcaggaaaga 4200aacctcattc tttggcaagc aggaggtgac actagatggc tgtgaatgtg atccgctcac 4260tgacactctg taaaggccaa tcaatgcact gtctgtcctc tcctttttag gagtaagcca 4320tcccacaagt tctataccat atttttagtg acagttgagg ttgtagatac actttataac 4380attttatagt ttaaagagct ttattaatgc aataaattaa ctttgtacac atttttatat 4440aaaaaaacag caagtgattt cagaatgttg taggcctcat tagagcttgg tctccaaaaa 4500tctgtttgaa aaaagcaaca tgttcttcac agtgttcccc tagaaaggaa gagatttaat 4560tgccagttag atgtggcatg aaatgaggga caaagaaagc atctcgtagg tgtgtctact 4620gggttttaac ttatttttct ttaataaaat acattgtttt cctaaaaaaa aaa 4673 101 1362DNA Homo sapiens 101 catttgggga cgctctcagc tctcggcgca cggcccagcttccttcaaaa tgtctactgt 60 tcacgaaatc ctgtgcaagc tcagcttgga gggtgatcactctacacccc caagtgcata 120 tgggtctgtc aaagcctata ctaactttga tgctgagcgggatgctttga acattgaaac 180 agccatcaag accaaaggtg tggatgaggt caccattgtcaacattttga ccaaccgcag 240 caatgcacag agacaggata ttgccttcgc ctaccagagaaggaccaaaa aggaacttgc 300 atcagcactg aagtcagcct tatctggcca cctggagacggtgattttgg gcctattgaa 360 gacacctgct cagtatgacg cttctgagct aaaagcttccatgaaggggc tgggaaccga 420 cgaggactct ctcattgaga tcatctgctc cagaaccaaccaggagctgc aggaaattaa 480 cagagtctac aaggaaatgt acaagactga tctggagaaggacattattt cggacacatc 540 tggtgacttc cgcaagctga tggttgccct ggcaaagggtagaagagcag aggatggctc 600 tgtcattgat tatgaactga ttgaccaaga tgctcgggatctctatgacg ctggagtgaa 660 gaggaaagga actgatgttc ccaagtggat cagcatcatgaccgagcgga gcgtgcccca 720 cctccagaaa gtatttgata ggtacaagag ttacagcccttatgacatgt tggaaagcat 780 caggaaagag gttaaaggag acctggaaaa tgctttcctgaacctggttc agtgcattca 840 gaacaagccc ctgtattttg ctgatcggct gtatgactccatgaagggca aggggacgcg 900 agataaggtc ctgatcagaa tcatggtctc ccgcagtgaagtggacatgt tgaaaattag 960 gtctgaattc aagagaaagt acggcaagtc cctgtactattatatccagc aagacactaa 1020 gggcgactac cagaaagcgc tgctgtacct gtgtggtggagatgactgaa gcccgacacg 1080 gcctgagcgt ccagaaatgg tgctcaccat gcttccagctaacaggtcta gaaaaccagc 1140 ttgcgaataa cagtccccgt ggccatccct gtgagggtgacgttagcatt acccccaacc 1200 tcattttagt tgcctaagca ttgcctggcc ttcctgtctagtctctcctg taagccaaag 1260 aaatgaacat tccaaggagt tggaagtgaa gtctatgatgtgaaacactt tgcctcctgt 1320 gtactgtgtc ataaacagat gaataaactg aatttgtacttt 1362 102 2591 DNA Homo sapiens 102 cccggacgtg cggctcccct cggcctcctcgccatggacg cggacgactc ccgggccccc 60 aagggctcct tgcggaagtt cctggagcacctctccgggg ccggcaaggc catcggcgtg 120 ctgaccagcg gcggggatgc tcaaggtatgaacgctgccg tccgtgccgt ggtgcgcatg 180 ggtatctacg tgggggccaa ggtgtacttcatctacgagg gctaccaggg catggtggac 240 ggaggctcaa acatcgcaga ggccgactgggagagtgtct ccagcatcct gcaagtgggc 300 gggacgatca ttggcagtgc gcggtgccaggccttccgca cgcgggaagg ccgcctgaag 360 gctgcttgca acctgctgca gcgcggcatcaccaacctgt gtgtgatcgg cggggacggg 420 agcctcaccg gggccaacct cttccggaaggagtggagtg ggctgctgga ggagctggcc 480 aggaacggcc agatcgataa ggaggccgtgcagaagtacg cctacctcaa cgtggtgggc 540 atggtgggct ccatcgacaa tgatttctgcggcaccgaca tgaccatcgg cacggactcc 600 gccctgcaca ggatcatcga ggtcgtcgacgccatcatga ccacggccca gagccaccag 660 aggaccttcg ttctggaggt gatgggacgacactgtgggt acctggccct ggtgagtgcc 720 ttggcctgcg gtgcggactg ggtgttccttccagaatctc caccagagga aggctgggag 780 gagcagatgt gtgtcaaact ctcggagaaccgtgcccgga aaaaaaggct gaatattatt 840 attgtggctg aaggagcaat tgatacccaaaataaaccca tcacctctga gaaaatcaaa 900 gagcttgtcg tcacgcagct gggctatgacacacgtgtga ccatcctcgg gcacgtgcag 960 agaggaggga ccccttcggc attcgacaggatcttggcca gccgcatggg agtggaggca 1020 gtcatcgcct tgctagaggc caccccggacaccccagctt gcgtcgtgtc actgaacggg 1080 aaccacgccg tgcgcctgcc gctgatggagtgcgtgcaga tgactcagga tgtgcagaag 1140 gcgatggacg agaggagatt tcaagatgcggttcgactcc gagggaggag ctttgcgggc 1200 aacctgaaca cctacaagcg acttgccatcaagctgccgg atgatcagat cccaaagacc 1260 aattgcaacg tagctgtcat caacgtgggggcacccgcgg ctgggatgaa cgcggccgta 1320 cgctcagctg tgcgcgtggg cattgccgacggccacagga tgctcgccat ctatgatggc 1380 tttgacggct tcgccaaggg ccagatcaaagaaatcggct ggacagatgt cgggggctgg 1440 accggccaag gaggctccat tcttgggacaaaacgcgttc tcccggggaa gtacttggaa 1500 gagatcgcca cacagatgcg cacgcacagcatcaacgcgc tgctgatcat cggtggattc 1560 gaggcctacc tgggactcct ggagctgtcagccgcccggg agaagcacga ggagttctgt 1620 gtccccatgg tcatggttcc cgctactgtgtccaacaatg tgccgggttc cgatttcagc 1680 atcggggcag acaccgccct gaacactatcaccgacacct gcgaccgcat caagcagtcc 1740 gccagcggaa ccaagcggcg cgtgttcatcatcgagacca tgggcggcta ctgtggctac 1800 ctggccaaca tgggggggct cgcggccggagctgatgccg catacatttt cgaagagccc 1860 ttcgacatca gggatctgca gtccaacgtggagcacctga cggagaaaat gaagaccacc 1920 atccagagag gccttgtgct cagaaatgagagctgcagtg aaaactacac caccgacttc 1980 atttaccagc tgtattcaga agagggcaaaggcgtgtttg actgcaggaa gaacgtgctg 2040 ggtcacatgc agcagggtgg ggcaccctctccatttgata gaaactttgg aaccaaaatc 2100 tctgccagag ctatggagtg gatcactgcaaaactcaagg aggcccgggg cagaggaaaa 2160 aaatttacca ccgatgattc catttgtgtgctgggaataa gcaaaagaaa cgttattttt 2220 caacctgtgg cagagctgaa gaagcaaacggattttgagc acaggattcc caaagaacag 2280 tggtggctca agctacggcc cctcatgaaaatcctggcca agtacaaggc cagctatgac 2340 gtgtcggact caggccagct ggaacatgtgcagccctgga gtgtctgacc cagtcccgcc 2400 tgcatgtgcc tgcagccacc gtggactgtctgtttttgta acacttaagt tattttatca 2460 gcactttatg cacgtattat tgacattaatacctaatcgg cgagtgccca tctgccccac 2520 cagctccagt gcgtgctgtc tgtggagtgtgtctcatgct ttcagatgtg catatgagca 2580 gaattaatta a 2591 103 865 DNA Homosapiens 103 gaattccgga gttccgggcg cgcgcgacgt cagtttgagt tctgtgttctccccgcccgt 60 gtcccgcccg acccgcgccc gcgatgctgg cgctgcgctg cggctcccgctggctcggcc 120 tgctctccgt cccgcgctcc gtgccgctgc gcctccccgc ggcccgcgcctgcagcaagg 180 gctccggcga cccgtcctct tcctcctcct ccgggaaccc gctcgtgtacctggacgtgg 240 acgccaacgg gaagccgctc ggccgcgtgg tgctggagct gaaggcagatgtcgtcccaa 300 agacagctga gaacttcaga gccctgtgca ctggtgagaa gggcttcggctacaaaggct 360 ccaccttcca cagggtgatc ccttccttca tgtgccaggc gggcgacttcaccaaccaca 420 atggcacagg cgggaagtcc atctacggaa gccgctttcc tgacgagaactttacactga 480 agcacgtggg gccaggtgtc ctgtccatgg ctaatgctgg tcctaacaccaacggctccc 540 agttcttcat ctgcaccata aagacagact ggttggatgg caagcatgttgtgttcggtc 600 acgtcaaaga gggcatggac gtcgtgaaga aaatagaatc tttcggctctaagagtggga 660 ggacatccaa gaagattgtc atcacagact gtggccagtt gagctaatctgtggccaggg 720 tgctggcatg gtggcagctg caaatgtcca tgcacccagg tggccgcgttgggctgtcag 780 ccaaggtgcc tgaaacgata cgtgtgccca ctccactgtc acagtgtgcctgaggaaggc 840 tgctagggat gttagacgga attcc 865 104 661 DNA Homo sapiens104 tcaaactgaa gctcgcactc tcgcctccag catgaaagtc tctgccgccc ttctgtgcct 60gctgctcata gcagccacct tcattcccca agggctcgct cagccagatg caatcaatgc 120cccagtcacc tgctgctata acttcaccaa taggaagatc tcagtgcaga ggctcgcgag 180ctatagaaga atcaccagca gcaagtgtcc caaagaagct gtgatcttca agaccattgt 240ggccaaggag atctgtgctg accccaagca gaagtgggtt caggattcca tggaccacct 300ggacaagcaa acccaaactc cgaagacttg aacactcact ccacaaccca agaatctgca 360gctaacttat tttcccctag ctttccccag acatcctgtt ttattttatt ataatgaatt 420ttgtttgttg atgtgaaaca ttatgcctta agtaatgtta attcttattt aagttattga 480tgttttaagt ttatctttca tggtactagt gttttttaga tacagagact tggggaaatt 540gcttttcctc ttgaaccaca gttctacccc tgggatgttt tgagggtctt tgcaagaatc 600atttttttaa cattccaatg catttaatac aaagaattgc taaaatatta ttgtggaaat 660 g661 105 420 DNA Homo sapiens 105 gggggctggc cgagcgccgt gcgcgcttgggagaaggccg gaagcttacc agccgagaag 60 gaattcctag ctagcttcag agccggtgcctccggagcca gcgtggtggc catagacaac 120 aagttcgaac aggccatgga tctggtgaagaatcatctga tgtatgctgt gagagaggag 180 gtggagatcc tgaaggagca gatccgagagctggtggaga agaactccca gctagagcgt 240 gagaacaccc tgttgaagac cctggcaagcccagagcagc tggagaagtt ccagtcctgt 300 ctgagccctg aagagccagc tcccgaatccccacaagtgc ccgaggcccc tggtggttct 360 gcggtgtaag tcgctctgtc ctcagggtgggcagagccac taaacttgtt ttacctaggg 420 106 926 DNA Homo sapiens 106gaatctcttt ctctcccttc agaatcttat cttggctttg gatcttagaa gagaatcact 60aaccagagac gagactcagt gagtgagcag gtgttttgga caatggactg gttgagccca 120tccctattat aaaaatgtct cagagcaacc gggagctggt ggttgacttt ctctcctaca 180agctttccca gaaaggatac agctggagtc agtttagtga tgtggaagag aacaggactg 240aggccccaga agggactgaa tcggagatgg agacccccag tgccatcaat ggcaacccat 300cctggcacct ggcagacagc cccgcggtga atggagccac tgcgcacagc agcagtttgg 360atgcccggga ggtgatcccc atggcagcag taaagcaagc gctgagggag gcaggcgacg 420agtttgaact gcggtaccgg cgggcattca gtgacctgac atcccagctc cacatcaccc 480cagggacagc atatcagagc tttgaacagg tagtgaatga actcttccgg gatggggtaa 540actggggtcg cattgtggcc tttttctcct tcggcggggc actgtgcgtg gaaagcgtag 600acaaggagat gcaggtattg gtgagtcgga tcgcagcttg gatggccact tacctgaatg 660accacctaga gccttggatc caggagaacg gcggctggga tacttttgtg gaactctatg 720ggaacaatgc agcagccgag agccgaaagg gccaggaacg cttcaaccgc tggttcctga 780cgggcatgac tgtggccggc gtggttctgc tgggctcact cttcagtcgg aaatgaccag 840acactgacca tccactctac cctcccaccc ccttctctgc tccaccacat cctccgtcca 900gccgccattg ccaccaggag aacccg 926 107 1293 DNA Homo sapiens 107cacgtcagcc ggggctagaa aaggcggcgg ggctgggccc agcgaggtga cagcctcgct 60tggacgcaga gcccggcccg acgccgccat gacggccgcg ctcttcagcc tggacggccc 120ggccggcggc gcgccctggc ctgcggagcc tgcgcccttc tacgaaccgg gccgggcggg 180caagccgggc cgcggggccg agccaggggc cctaggcgag ccaggcgccg ccgcccccgc 240catgtacgac gacgagagcg ccatcgactt cagcgcctac atcgactcca tggccgccgt 300gcccaccctg gagctgtgcc acgacgagct cttcgccgac ctcttcaaca gcaatcacaa 360ggcgggcggc gcggggcccc tggagcttct tcccggcggc cccgcgcgcc ccttgggccc 420gggccctgcc gctccccgcc tgctcaagcg cgagcccgac tggggcgacg gcgacgcgcc 480cggctcgctg ttgcccgcgc aggtgggccc gtgcgcacag accgtggtga gcttggcggc 540cgcagggcag cccaccccgc ccacgtcgcc ggagccgccg cgcagcagcc ccaggcagac 600ccccgcgccc ggccccgccc gggagaagag cgccggcaag aggggcccgg accgcggcag 660ccccgagtac cggcagcggc gcgagcgcaa caacatcgcc gtgcgcaaga gccgcgacaa 720ggccaagcgg cgcaaccagg agatgcagca gaagttggtg gagctgtcgg ctgagaacga 780gaagctgcac cagcgcgtgg agcagctcac gcgggacctg gccggcctcc ggcagttctt 840caagcagctg cccagcccgc ccttcctgcc ggccgccggg acagcagact gccggtaacg 900cgcggccggg gcgggagaga ctcagcaacg acccatacct cagacccgac ggcccggagc 960ggacgccctg ctgccgacgc cagagccgcc gcgtgcccgc tgcagtttct tggacataga 1020ccaaagaagc tacagcctgg acttaccacc actaaactgc gagagaagct aaacgtgttt 1080attttccctt aaattatttt tgtaatggta gctttttcta catcttactc ctgttgatgc 1140agctaaggta catttgtaaa aagaaaaaaa accagacttt tcagacaaac cctttgtatt 1200gtagataaga ggaaaagact gagcatgctc acttttttat attaattttt aggacagtat 1260ttgtaagaat aaagcagcat ttgaaatgcc cct 1293 108 2529 DNA Homo sapiens 108ccagcaaaac ctgtttagac acatggacaa gaatcccagc gctacaaggc acacagtccg 60cttcttcgtc ctcagggttg ccagcgcttc ctggaagtcc tgaagctctc gcagtgcagt 120gagttcatgc accttcttgc caagcctcag tctttgggat ctggggaggc cgcctggttt 180tcctccctcc ttctgcacgt ctgctggggt ctcttcctct ccaggccttg ccgtccccct 240ggcctctctt cccagctcac acatgaagat gcacttgcaa agggctctgg tggtcctggc 300cctgctgaac tttgccacgg tcagcctctc tctgtccact tgcaccacct tggacttcgg 360ccacatcaag aagaagaggg tggaagccat taggggacag atcttgagca agctcaggct 420caccagcccc cctgagccaa cggtgatgac ccacgtcccc tatcaggtcc tggcccttta 480caacagcacc cgggagctgc tggaggagat gcatggggag agggaggaag gctgcaccca 540ggaaaacacc gagtcggaat actatgccaa agaaatccat aaattcgaca tgatccaggg 600gctggcggag cacaacgaac tggctgtctg ccctaaagga attacctcca aggttttccg 660cttcaatgtg tcctcagtgg agaaaaatag aaccaaccta ttccgagcag aattccgggt 720cttgcgggtg cccaacccca gctctaagcg gaatgagcag aggatcgagc tcttccagat 780ccttcggcca gatgagcaca ttgccaaaca gcgctatatc ggtggcaaga atctgcccac 840acggggcact gccgagtggc tgtcctttga tgtcactgac actgtgcgtg agtggctgtt 900gagaagagag tccaacttag gtctagaaat cagcattcac tgtccatgtc acacctttca 960gcccaatgga gatatcctgg aaaacattca cgaggtgatg gaaatcaaat tcaaaggcgt 1020ggacaatgag gatgaccatg gccgtggaga tctggggcgc ctcaagaagc agaaggatca 1080ccacaaccct catctaatcc tcatgatgat tcccccacac cggctcgaca acccgggcca 1140ggggggtcag aggaagaagc gggctttgga caccaattac tgcttccgca acttggagga 1200gaactgctgt gtgcgccccc tctacattga cttccgacag gatctgggct ggaagtgggt 1260ccatgaacct aagggctact atgccaactt ctgctcaggc ccttgcccat acctccgcag 1320tgcagacaca acccacagca cggtgctggg actgtacaac actctgaacc ctgaagcatc 1380tgcctcgcct tgctgcgtgc cccaggacct ggagcccctg accatcctgt actatgttgg 1440gaggaccccc aaagtggagc agctctccaa catggtggtg aagtcttgta aatgtagctg 1500agaccccacg tgcgacagag agaggggaga gagaaccacc actgcctgac tgcccgctcc 1560tcgggaaaca cacaagcaac aaacctcact gagaggcctg gagcccacaa ccttcggctc 1620cgggcaaatg gctgagatgg aggtttcctt ttggaacatt tctttcttgc tggctctgag 1680aatcacggtg gtaaagaaag tgtgggtttg gttagaggaa ggctgaactc ttcagaacac 1740acagactttc tgtgacgcag acagagggga tggggataga ggaaagggat ggtaagttga 1800gatgttgtgt ggcaatggga tttgggctac cctaaaggga gaaggaaggg cagagaatgg 1860ctgggtcagg gccagactgg aagacacttc agatctgagg ttggatttgc tcattgctgt 1920accacatctg ctctagggaa tctggattat gttatacaag gcaagcattt tttttttttt 1980ttaaagacag gttacgaaga caaagtccca gaattgtatc tcatactgtc tgggattaag 2040ggcaaatcta ttacttttgc aaactgtcct ctacatcaat taacatcgtg ggtcactaca 2100gggagaaaat ccaggtcatg cagttcctgg cccatcaact gtattgggcc ttttggatat 2160gctgaacgca gaagaaaggg tggaaatcaa ccctctcctg tctgcctctg ggtccctcct 2220ctcacctctc cctcgatcat atttcccctt ggacacttgg ttagacgcct tccaggtcag 2280gatgcacatt tctggattgt ggttccatgc agggttgggg cattatgggt tcttccccca 2340cttcccctcc aagaccctgt gttcatttgg tgttcctgga agcaggtgcg acaacatgtg 2400aggcattcgg ggaagctcga catgtgccac acagtgactt ggccccagac gcatagactg 2460aggtataaag acaagtatga atattactct caaaatcttt gtataaataa atatttttgg 2520ggcatcctg 2529

What is claimed is:
 1. A method to identify agonist ligands ofprogesterone receptors, comprising: a. contacting a progesteronereceptor with a putative agonist ligand, wherein said progesteronereceptor is selected from the group consisting of progesterone receptorA (PR-A) and progesterone receptor B (PR-B), under conditions wherein,in the absence of said putative agonist ligand, said progesteronereceptor is not activated; b. detecting expression of at least one genethat is regulated by said progesterone receptor when said progesteronereceptor is activated, said at least one gene being selected from thegroup consisting of: i. at least one gene that is selectivelyupregulated by PR-A chosen from a gene in Table 1; ii. at least one genethat is selectively downregulated by PR-A chosen from a gene in Table 2;iii. at least one gene that is selectively upregulated by PR-B chosenfrom a gene in Table 3; iv. at least one gene that is selectivelydownregulated by PR-B chosen from a gene in Table 4; v. at least onegene that is upregulated or downregulated by both PR-A and PR-B chosenfrom a gene in Table 5; vi. at least one gene that is reciprocallyregulated by PR-A and PR-B chosen from a gene in Table 6; and, vii. atleast one gene that is regulated by one of said PR-A or said PR-B,wherein regulation of said gene is altered when the other of said PR-Aor PR-B is expressed by the same cell, chosen from a gene in Table 7;and, c. comparing the expression of said at least one gene in thepresence and in the absence of said putative agonist ligand, whereindetection of regulation of the expression of said at least one gene inthe manner associated with activation of said progesterone receptor asset forth in (b) indicates that said putative agonist ligand is aprogesterone receptor agonist.
 2. The method of claim 1, wherein saidprogesterone receptor is PR-A.
 3. The method of claim 1, wherein saidprogesterone receptor is PR-B.
 4. The method of claim 1, wherein saidprogesterone receptor comprises both PR-A and PR-B.
 5. The method ofclaim 1, wherein detection of upregulation of expression of at least onegene chosen from a gene in Table 1, or detection of downregulation of atleast one gene chosen from a gene in Table 2, in the presence of saidputative agonist ligand, indicates that said putative agonist ligand isa selective agonist of PR-A.
 6. The method of claim 1, wherein detectionof upregulation of expression of at least one gene chosen from a gene inTable 3, or detection of downregulation of at least one gene chosen froma gene in Table 4, in the presence of said putative agonist ligand,indicates that said putative agonist ligand is a selective agonist ofPR-B.
 7. The method of claim 1, wherein said step (b) of detectingcomprises detecting expression of at least five genes from any one ormore of said Tables 1-7.
 8. The method of claim 1, wherein said step (b)of detecting comprises detecting expression of at least ten genes fromany one or more of said Tables 1-7.
 9. The method of claim 1, whereinsaid step (b) of detecting comprises detecting expression of at least 15genes from any one or more of said Tables 1-7.
 10. The method of claim1, further comprising a step of detecting expression of at least onegene chosen from the genes in Table
 8. 11. The method of claim 1,wherein said progesterone receptor is expressed by a cell.
 12. Themethod of claim 11, wherein said progesterone receptor is endogenouslyexpressed by said cell.
 13. The method of claim 11, wherein saidprogesterone receptor is recombinantly expressed by said cell.
 14. Themethod of claim 11, wherein said cell is part of a tissue from a testanimal.
 15. The method of claim 14, wherein said step of contacting isperformed by administration of said putative agonist ligand to said testanimal or to said tissue of said test animal.
 16. The method of claim 1,wherein expression of said at least one gene is detected by measuringamounts of transcripts of said at least one gene before and aftercontact of said progesterone receptor with said putative agonist ligand.17. The method of claim 1, wherein expression of said at least one geneis detected by detecting hybridization of at least a portion of said atleast one gene or a transcript thereof to a nucleic acid moleculecomprising a portion of said at least one gene or a transcript thereofin a nucleic acid array.
 18. The method of claim 1, wherein expressionof said at least one gene is detected by measuring expression of areporter gene that is operatively linked to at least the regulatoryregion of said at least one gene.
 19. The method of claim 1, whereinexpression of said at least one gene is detected by detecting theproduction of a protein encoded by said at least one gene.
 20. Themethod of claim 1, wherein said putative agonist ligand is a product ofrational drug design.
 21. The method of claim 1, comprising, in step(b), detecting expression of: 11-beta-hydroxysteroid dehydrogenase type2, tissue factor gene, PCI gene (plasminogen activator inhibitor 3),MAD-3 Ikβ-alpha, Niemann-Pick C disease (NPC1), platelet-typephosphofructokinase, phenylethanolamine n-methyltransferase (PNMT),transforming growth factor-beta 3 (TGF-beta3), Monocyte ChemotacticProtein 1, delta sleep inducing peptide (related to TSC-22), andestrogen receptor-related protein (hERRa1).
 22. The method of claim 1,comprising, in step (b), detecting expression of: growth arrest-specificprotein (gas6), tissue factor gene, NF-IL6-beta (C/EBPbeta), PCI gene(plasminogen activator inhibitor), Stat5A, calcium-binding proteinS100P, MSX-2, lipocortin II (calpactin I), selenium-binding protein(hSBP), and bullous pemphigoid antigen (plakin family).
 23. The methodof claim 1, comprising, in step (b), detecting expression ofphenylethanolamine n-methyltransferase (PNMT) adrenal medulla.
 24. Themethod of claim 1, comprising, in step (b), detecting expression ofproteasome-like subunit MECL-1.
 25. The method of claim 1, comprising,in step (b), detecting expression of: growth arrest-specific protein andtissue factor gene.
 26. A method to identify antagonists of progesteronereceptors, comprising: a. contacting a progesterone receptor with aputative antagonist ligand, wherein said progesterone receptor isselected from the group consisting of progesterone receptor A (PR-A) andprogesterone receptor B (PR-B), under conditions wherein, in the absenceof said putative antagonist ligand, said progesterone receptor isactivated; b. detecting expression of at least one gene that isregulated by said progesterone receptor when said progesterone receptoris activated, said at least one gene being selected from the groupconsisting of: i. at least one gene that is selectively upregulated byPR-A chosen from a gene in Table 1; ii. at least one gene that isselectively downregulated by PR-A chosen from a gene in Table 2; iii. atleast one gene that is selectively upregulated by PR-B chosen from agene in Table 3; iv. at least one gene that is selectively downregulatedby PR-B chosen from a gene in Table 4; v. at least one gene that isupregulated or downregulated by both PR-A and PR-B chosen from a gene inTable 5; vi. at least one gene that is reciprocally regulated by PR-Aand PR-B chosen from a gene in Table 6; and, vii. at least one gene thatis regulated by one of said PR-A or said PR-B, wherein regulation ofsaid gene is altered when the other of said PR-A or PR-B is expressed bythe same cell, chosen from a gene in Table 7; and, c. comparing theexpression of said at least one gene in the presence and in the absenceof said putative antagonist ligand, wherein detection of inhibition orreversal of the regulation of expression of said at least one gene ascompared to the regulation of expression of said at least one gene inthe manner associated with activation of said progesterone receptor asset forth in (b), indicates that said putative antagonist ligand is aprogesterone receptor antagonist.
 27. The method of claim 26, whereinsaid progesterone receptor is PR-A.
 28. The method of claim 26, whereinsaid progesterone receptor is PR-B.
 29. The method of claim 26, whereinsaid progesterone receptor comprises both PR-A and PR-B.
 30. The methodof claim 26, wherein said progesterone receptor is activated bycontacting said receptor with a compound that activates said receptor,said step of contacting being performed prior to, simultaneously with,or after said step of contacting of (a).
 31. The method of claim 26,wherein detection of inhibition of expression or downregulatedexpression of at least one gene chosen from a gene in Table 1 in thepresence of said putative antagonist ligand as compared to theexpression of said at least one gene in the presence of said compoundthat activates said progesterone receptor, or detection of inhibition ofexpression or upregulation of expression of at least one gene chosenfrom a gene in Table 2 in the presence of said putative antagonistligand as compared to the expression of said at least one gene in thepresence of said compound that activates said progesterone receptor,indicates that said putative antagonist ligand is a selective antagonistof PR-A.
 32. The method of claim 26, wherein detection of inhibition ofexpression or downregulation of expression of at least one gene chosenfrom a gene in Table 3 in the presence of said putative antagonistligand as compared to the expression of said at least one gene in thepresence of said compound that activates said progesterone receptor, ordetection of inhibition of expression or upregulation of expression ofat least one gene chosen from a gene in Table 4, in the presence of saidputative antagonist ligand as compared to the expression of said atleast one gene in the presence of said compound that activates saidprogesterone receptor, indicates that said putative antagonist ligand isa selective antagonist of PR-B.
 33. The method of claim 26, wherein saidstep (b) of detecting comprises detecting expression of at least fivegenes from any one or more of said Tables 1-7.
 34. The method of claim26, wherein said step (b) of detecting comprises detecting expression ofat least ten genes from any one or more of said Tables 1-7.
 35. Themethod of claim 26, wherein said step (b) of detecting comprisesdetecting expression of at least 15 genes from any one or more of saidTables 1-7.
 36. The method of claim 26, further comprising a step ofdetecting expression of at least one gene chosen from the genes in Table8.
 37. The method of claim 26, wherein said progesterone receptor isexpressed by a cell.
 38. The method of claim 37, wherein saidprogesterone receptor is endogenously expressed by said cell.
 39. Themethod of claim 37, wherein said progesterone receptor is recombinantlyexpressed by said cell.
 40. The method of claim 37, wherein said cell ispart of a tissue from a test animal.
 41. The method of claim 40, whereinsaid step of contacting is performed by administration of said putativeagonist ligand to said test animal.
 42. The method of claim 26, whereinexpression of said at least one gene is detected by measuring amounts oftranscripts of said at least one gene before and after contact of saidprogesterone receptor with said putative agonist ligand.
 43. The methodof claim 26, wherein expression of said at least one gene is detected bydetecting hybridization of at least a portion of said at least one geneor a transcript thereof to a nucleic acid molecule comprising a portionof said at least one gene or a transcript thereof in a nucleic acidarray.
 44. The method of claim 26, wherein expression of said at leastone gene is detected by measuring expression of a reporter gene that isoperatively linked to at least the regulatory region of said at leastone gene.
 45. The method of claim 26, wherein expression of said atleast one gene is detected by detecting the production of a proteinencoded by said at least one gene.
 46. The method of claim 26, whereinsaid putative antagonist ligand is a product of rational drug design.47. A method to identify isoform-specific agonists of progesteronereceptors, comprising: a. contacting a progesterone receptor with aputative agonist ligand, wherein said progesterone receptor is selectedfrom the group consisting of progesterone receptor A (PR-A) andprogesterone receptor B (PR-B), under conditions wherein in the absenceof said putative agonist ligand, said progesterone receptor is notactivated; b. detecting expression of at least one gene that isregulated by said progesterone receptor when said progesterone receptoris activated, said at least one gene being selected from the groupconsisting of: i. at least one gene that is exclusively upregulated ordownregulated by PR-A, chosen from a Table selected from the groupconsisting of Table 1 and Table 2; and, ii. at least one gene that isexclusively upregulated or downregulated by PR-B chosen from a Tableselected from the group consisting of Table 3 and Table 4; and, c.comparing the expression of said at least one gene in the presence andin the absence of said putative agonist ligand, wherein detection ofregulation of the expression of said at least one gene in the mannerassociated with activation of said progesterone receptor as set forth in(b)(i) but not (b)(ii), indicates that said putative agonist ligand is aPR-A-specific agonist, and wherein detection of regulation of theexpression of said at least one gene in the manner associated withactivation of said progesterone receptor as set forth in (b)(ii) but not(b)(i), indicates that said putative agonist ligand is a PR-B-specificagonist.
 48. The method of claim 47, wherein said progesterone receptorcomprises both PR-A and PR-B.
 49. The method of claim 47, wherein saidstep (b) of detecting comprises detecting expression of at least fivegenes from any one or more of said Tables 1-4.
 50. The method of claim47, wherein said step (b) of detecting comprises detecting expression ofat least ten genes from any one or more of said Tables 1-4.
 51. Themethod of claim 47, wherein said step (b) of detecting comprisesdetecting expression of at least 15 genes from any one or more of saidTables 1-4.
 52. A method to identify isoform-specific antagonists ofprogesterone receptors, comprising: a. contacting a progesteronereceptor with a putative antagonist ligand, wherein said progesteronereceptor is selected from the group consisting of progesterone receptorA (PR-A) and progesterone receptor B (PR-B), under conditions wherein,in the absence of said putative antagonist ligand, said progesteronereceptor is activated; b. detecting expression of at least one gene thatis regulated by said progesterone receptor when said progesteronereceptor is activated, said at least one gene being selected from thegroup consisting of: i. at least one gene that is exclusivelyupregulated or downregulated by PR-A, chosen from a Table selected fromthe group consisting of Table 1 and Table 2; and, ii. at least one genethat is exclusively upregulated or downregulated by PR-B chosen from aTable selected from the group consisting of Table 3 and Table 4; and, c.comparing the expression of said at least one gene in the presence andin the absence of said putative antagonist ligand, wherein, in thepresence of said putative antagonist ligand, detection of inhibition orreversal of the regulation of expression of said at least one gene ascompared to the regulation of expression of said at least one gene inthe manner associated with activation of said progesterone receptor asset forth in (b)(i) but not (b)(ii), indicates that said putativeantagonist ligand is a PR-A-specific antagonist, and wherein, in thepresence of said putative antagonist ligand, detection of inhibition orreversal of the regulation of expression of said at least one gene ascompared to the regulation of the expression of said at least one genein the manner associated with activation of said progesterone receptoras set forth in (b)(ii) but not (b)(i), indicates that said putativeantagonist ligand is a PR-B-specific antagonist.
 53. The method of claim52, wherein said progesterone receptor comprises both PR-A and PR-B. 54.The method of claim 52, wherein said step (b) of detecting comprisesdetecting expression of at least five genes from any one or more of saidTables 1-4.
 55. The method of claim 52, wherein said step (b) ofdetecting comprises detecting expression of at least ten genes from anyone or more of said Tables 1-4.
 56. The method of claim 52, wherein saidstep (b) of detecting comprises detecting expression of at least 15genes from any one or more of said Tables 1-4.
 57. A method to identifya tissue-specific agonist of a progesterone receptor, comprising: a.providing an expression profile for at least one gene that is known tobe regulated by a progesterone receptor in both a first and secondtissue type when said progesterone receptor is activated, wherein saidat least one gene is chosen from the genes in any one or more of thegenes in Tables 1-7; b. contacting a progesterone receptor expressed bya first tissue type with a putative agonist ligand, wherein saidprogesterone receptor is selected from the group consisting ofprogesterone receptor A (PR-A) and progesterone receptor B (PR-B), underconditions wherein, in the absence of said putative agonist ligand, saidprogesterone receptor is not activated; c. contacting a progesteronereceptor expressed by a second tissue type with said putative agonistligand under conditions wherein, in the absence of said putative agonistligand, said progesterone receptor is not activated, wherein saidprogesterone receptor is the same isoform as the progesterone receptorcontacted in (b); d. detecting expression of said at least one gene from(a); e. comparing the expression of said at least one gene in thepresence and in the absence of said putative agonist ligand in each ofsaid first and second tissue types, wherein detection of regulation ofthe expression of said at least one gene in one of said first or secondtissue types in the manner associated with activation of saidprogesterone receptor as set forth in said expression profile of (a),and detection of inhibition of regulation or no regulation of said atleast one gene in the other of said first or second tissue types, ascompared to the expression of said at least one gene associated withactivation of said progesterone receptor as set forth in said expressionprofile of (a), indicates that said putative agonist ligand is atissue-specific progesterone receptor agonist.
 58. The method of claim57, wherein said first tissue type is breast, and wherein said at leastone gene is selected from the group consisting of: i. at least one genethat is selectively upregulated by PR-A chosen from a gene in Table 1;ii. at least one gene that is selectively downregulated by PR-A chosenfrom a gene in Table 2; iii. at least one gene that is selectivelyupregulated by PR-B chosen from a gene in Table 3; iv. at least one genethat is selectively downregulated by PR-B chosen from a gene in Table 4;v. at least one gene that is upregulated or downregulated by both PR-Aand PR-B chosen from a gene in Table 5; vi. at least one gene that isreciprocally regulated by PR-A and PR-B chosen from a gene in Table 6;and, vii. at least one gene that is regulated by one of said PR-A orsaid PR-B, wherein regulation of said gene is altered when the other ofsaid PR-A or PR-B is expressed by the same cell, chosen from a gene inTable
 7. 59. The method of claim 57, wherein said second tissue type isselected from the group consisting of breast, uterus, bone, cartilage,cardiovascular tissues, heart, lung, brain, meninges, pituitary, ovary,oocyte, corpus luteum, oviduct, fallopian tubes, T lymphocytes, Blymphocytes, thymocytes, salivary gland, placenta, skin, gut, pancreas,liver, testis, epididymis, bladder, urinary tract, eye, and teeth. 60.The method of claim 57, wherein said first tissue type is anon-malignant tissue and wherein said second tissue type is a malignanttissue from the same tissue source as the first tissue type.
 61. Themethod of claim 60, wherein said tissue source is breast tissue.
 62. Themethod of claim 57, wherein said first tissue type is a normal tissueand wherein said second tissue type is a non-malignant, abnormal tissue.63. The method of claim 57, wherein said expression profile of genesregulated by a progesterone receptor in said first or second tissue typeis provided by a method comprising: a. providing a first cell of aselected tissue type that expresses a progesterone receptor A (PR-A) andnot a progesterone receptor B (PR-B) and a second cell of the sametissue type that expresses PR-B and not PR-A; b. stimulating saidprogesterone receptors in (a) by contacting said first and second cellswith a progesterone receptor stimulatory ligand; c. detecting expressionof genes by said first and second cells in the presence of saidstimulatory ligand and in the absence of said stimulatory ligand,wherein a difference in the expression of a gene in the presence of saidstimulatory ligand as compared to in the absence of said stimulatoryligand, indicates that said gene is regulated by said progesteronereceptor in said selected tissue type.
 64. A method to identify atissue-specific antagonist of a progesterone receptor, comprising: a.providing an expression profile for at least one gene that is known tobe regulated by a progesterone receptor in both a first and secondtissue type when said progesterone receptor is activated, wherein saidat least one gene is chosen from the genes in any one or more of thegenes in Tables 1-7; b. contacting a progesterone receptor expressed bya first tissue type with a putative antagonist ligand, wherein saidprogesterone receptor is selected from the group consisting ofprogesterone receptor A (PR-A) and progesterone receptor B (PR-B), underconditions wherein, in the absence of said putative antagonist ligand,said progesterone receptor is activated; c. contacting a progesteronereceptor expressed by a second tissue type with said putative antagonistligand, wherein said progesterone receptor is selected from the groupconsisting of progesterone receptor A (PR-A) and progesterone receptor B(PR-B), under conditions wherein, in the absence of said putativeantagonist ligand, said progesterone receptor is activated; d. detectingexpression of said at least one gene from (a); and, e. comparing theexpression of said at least one gene in the presence and in the absenceof said putative antagonist ligand in each of said first and secondtissue types, wherein detection of regulation of the expression of saidat least one gene in one of said first or second tissue types in themanner associated with activation of said progesterone receptor as setforth in said expression profile of (a) in the presence of said putativeantagonist ligand, and detection of inhibition or reversal of regulationof expression of said at least one gene in the other of said first orsecond tissue types in the presence of said putative antagonist ligand,as compared to the expression of said at least one gene associated withactivation of said progesterone receptor as set forth in said expressionprofile of (a), indicates that said putative antagonist ligand is atissue-specific progesterone receptor antagonist.
 65. The method ofclaim 64, wherein said first tissue type is breast, and wherein said atleast one gene is selected from the group consisting of: i. at least onegene that is selectively upregulated by PR-A chosen from a gene in Table1; ii. at least one gene that is selectively downregulated by PR-Achosen from a gene in Table 2; iii. at least one gene that isselectively upregulated by PR-B chosen from a gene in Table 3; iv. atleast one gene that is selectively downregulated by PR-B chosen from agene in Table 4; v. at least one gene that is upregulated ordownregulated by both PR-A and PR-B chosen from a gene in Table 5; vi.at least one gene that is reciprocally regulated by PR-A and PR-B chosenfrom a gene in Table 6; and, vii. at least one gene that is regulated byone of said PR-A or said PR-B, wherein regulation of said gene isaltered when the other of said PR-A or PR-B is expressed by the samecell, chosen from a gene in Table
 7. 66. The method of claim 64, whereinsaid second tissue type is selected from the group consisting of breast,uterus, bone, cartilage, cardiovascular tissues, heart, lung, brain,meninges, pituitary, ovary, oocyte, corpus luteum, oviduct, fallopiantubes, T lymphocytes, B lymphocytes, thymocytes, salivary gland,placenta, skin, gut, pancreas, liver, testis, epididymis, bladder,urinary tract, eye, and teeth.
 67. The method of claim 64, wherein saidfirst tissue type is a non-malignant tissue and wherein said secondtissue type is a malignant tissue from the same tissue source as thefirst tissue type.
 68. The method of claim 67, wherein said tissuesource is breast tissue.
 69. A method to identify a tissue-specificagonist of a progesterone receptor, comprising: a. providing anexpression profile for at least one gene that is known to be regulatedby a progesterone receptor in a first tissue type but not a secondtissue type when said progesterone receptor is activated, wherein saidat least one gene is chosen from the genes in any one or more of thegenes in Tables 1-7; b. contacting a progesterone receptor expressed bysaid first tissue type with a putative agonist ligand, wherein saidprogesterone receptor is selected from the group consisting ofprogesterone receptor A (PR-A) and progesterone receptor B (PR-B), underconditions wherein, in the absence of said putative agonist ligand, saidprogesterone receptor is not activated; c. detecting expression of saidat least one gene from (a); d. comparing the expression of said at leastone gene in the presence and in the absence of said putative agonistligand in said first tissue type, wherein detection of regulation of theexpression of said at least one gene in said first tissue type in themanner associated with activation of said progesterone receptor as setforth in said expression profile of (a) indicates that said putativeagonist ligand is a tissue-specific progesterone receptor agonist forsaid first tissue type.
 70. A method to identify a tissue-specificantagonist of a progesterone receptor, comprising: a. providing anexpression profile for at least one gene that is known to be regulatedby a progesterone receptor in a first but not in a second tissue typewhen said progesterone receptor is activated, wherein said at least onegene is chosen from the genes in any one or more of the genes in Tables1-7; b. contacting a progesterone receptor expressed by a first tissuetype with a putative antagonist ligand, wherein said progesteronereceptor is selected from the group consisting of progesterone receptorA (PR-A) and progesterone receptor B (PR-B), under conditions wherein,in the absence of said putative antagonist ligand, said progesteronereceptor is activated; c. detecting expression of said at least one genefrom (a); and, d. comparing the expression of said at least one gene inthe presence and in the absence of said putative antagonist ligand insaid first tissue type, wherein detection of inhibition or reversal ofregulation of expression of said at least one gene in said first tissuetype in the presence of said putative antagonist ligand, as compared tothe expression of said at least one gene associated with activation ofsaid progesterone receptor as set forth in said expression profile of(a), indicates that said putative antagonist ligand is a tissue-specificprogesterone receptor antagonist of said first tissue type.
 71. A methodto determine the profile of genes regulated by progesterone receptors ina breast tumor sample, comprising: a. obtaining from a patient a breasttumor sample; b. detecting expression of at least one gene in saidbreast tumor sample that is regulated by a progesterone receptor whensaid progesterone receptor is activated, said at least one gene beingselected from the group consisting of: i. at least one gene that isselectively upregulated by PR-A chosen from a gene in Table 9; ii. atleast one gene that is selectively downregulated by PR-A chosen from agene in Table 10; iii. at least one gene that is selectively upregulatedby PR-B chosen from a gene in Table 11; iv. at least one gene that isselectively downregulated by PR-B chosen from a gene in Table 12; v. atleast one gene that is upregulated or downregulated by both PR-A andPR-B chosen from a gene in Table 13; vi. at least one gene that isreciprocally regulated by PR-A and PR-B chosen from a gene in Table 14;and, vii. at least one gene that is regulated by one of said PR-A orsaid PR-B, wherein regulation of said gene is altered when the other ofsaid PR-A or PR-B is expressed by the same cell, chosen from a gene inTable 15; and, c. producing a profile of genes for said tumor samplethat are regulated selectively by PR-A, selectively by PR-B, or by bothPR-A and PR-B.
 72. A plurality of polynucleotides for the detection ofthe expression of genes regulated by progesterone receptors in breasttissue; wherein said plurality of polynucleotides consists ofpolynucleotide probes that are complementary to RNA transcripts, ornucleotides derived therefrom, of genes that are regulated byprogesterone receptors; and wherein said plurality of polynucleotidescomprises polynucleotide probes that are complementary to RNAtranscripts, or nucleotides derived therefrom, of genes selected fromthe group consisting of: a. at least one gene that is selectivelyupregulated by PR-A chosen from a gene in Table 1; b. at least one genethat is selectively downregulated by PR-A chosen from a gene in Table 2;c. at least one gene that is selectively upregulated by PR-B chosen froma gene in Table 3; d. at least one gene that is selectivelydownregulated by PR-B chosen from a gene in Table 4; e. at least onegene that is upregulated or downregulated by both PR-A and PR-B chosenfrom a gene in Table 5; f. at least one gene that is reciprocallyregulated by PR-A and PR-B chosen from a gene in Table 6; and, g. atleast one gene that is regulated by one of said PR-A or said PR-B,wherein regulation of said gene is altered when the other of said PR-Aor PR-B is expressed by the same cell, chosen from a gene in Table 7.73. The plurality of polynucleotides of claim 72, wherein saidpolynucleotide probes are immobilized on a substrate.
 74. The pluralityof polynucleotides of claim 72, wherein said polynucleotide probes arehybridizable array elements in a microarray.
 75. The plurality ofpolynucleotides of claim 72, wherein said polynucleotide probes areconjugated to detectable markers.
 76. The plurality of polynucleotidesof claim 72, wherein said plurality of polynucleotides further comprisesat least one polynucleotide probe that is complementary to RNAtranscripts, or nucleotides derived therefrom, of at least one genechosen from the genes in Table
 8. 77. A plurality of antibodies, orantigen binding fragments thereof, for the detection of the expressionof genes regulated by progesterone receptors in breast tissue; whereinsaid plurality of antibodies, or antigen binding fragments thereof,consists of antibodies, or antigen binding fragments thereof, thatselectively bind to proteins encoded by genes that are regulated byprogesterone receptors; and wherein said plurality of antibodies, orantigen binding fragments thereof, comprises antibodies, or antigenbinding fragments thereof, that selectively bind to proteins encoded bygenes selected from the group consisting of: a. genes that areselectively upregulated by PR-A chosen from genes in Table 1; b. genesthat are selectively downregulated by PR-A chosen from genes in Table 2;c. genes that are selectively upregulated by PR-B chosen from genes inTable 3; d. genes that are selectively downregulated by PR-B chosen fromgenes in Table 4; e. genes that are upregulated or downregulated by bothPR-A and PR-B chosen from genes in Table 5; f. genes that arereciprocally regulated by PR-A and PR-B chosen from genes in Table 6;and, g. genes that are regulated by one of said PR-A or said PR-B,wherein regulation of said gene is altered when the other of said PR-Aor PR-B is expressed by the same cell, chosen from genes in Table
 7. 78.The plurality of antibodies, or antigen binding fragments thereof, ofclaim 77, wherein said plurality of antibodies, or antigen bindingfragments thereof, further comprises at least one antibody, or anantigen binding fragment thereof, that selectively binds to a proteinencoded by a gene chosen from the genes in Table
 8. 79. A method toidentify genes that are regulated by a progesterone receptor in two ormore tissue types, comprising: a. activating a progesterone receptor intwo or more tissue types that express said progesterone receptor; b.detecting expression of at least one gene said two or more tissue types,said at least one gene being chosen from a gene in any one or more ofTables 1-7, and, c. identifying genes that are regulated by saidprogesterone receptor in each of said two or more tissue types.
 80. Themethod of claim 79, further comprising detecting whether said genes areregulated selectively by PR-A, selectively by PR-B, or by both PR-A andPR-B.
 81. A method to regulate the expression of a gene selected fromthe group consisting of any one or more of said genes in Tables 1-7,wherein said method comprises administering to a cell that expresses aprogesterone receptor a compound selected from the group consisting of:progesterone, aprogestin, and an antiprogestin, wherein said compound iseffective to regulate the expression of said gene.
 82. The method ofclaim 81, wherein said gene is selected from the group consisting of:growth arrest-specific protein (gas6), NF-IL6-beta (C/EBPbeta),calcium-binding protein S100P, MSX-2, selenium-binding protein (hSBP),and bullous pemphigoid antigen (plakin family).
 83. The method of claim81, wherein said cell that expresses a progesterone receptor is in thebreast tissue of a patient that has, or is at risk of developing, breastcancer.